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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATGAGGGGGA TGCTGCCCCT CTTTGAGCCC AAGGGCCGGG TCCTCCTGGT GGACGGCCAC 60 

CACCTGGCCT ACCGCACCTT CCACGCCCTG AAGGGCCTCA CCACCAGCCG GGGGGAGCCG 12 0 

GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTCAAGGA GGACGGGGAC 180 

GCGGTGATCG TGGTCTTTGA CGCCAAGGCC CCCTCCTTCC GCCACGAGGC CTACGGGGGG 240 

TACAAGGCGG GCCGGGCCCC CACGCCGGAG GACTTTCCCC GGCAACTCGC CCTCATCAAG 3 00 
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GAGCTGGTGG ACCTCCTGGG GCTGGCGCGC CTCGAGGTCC CGGGCTACGA GGCGGACGAC 360 

GTCCTGGCCA GCCTGGCCAA GAAGGCGGAA AAGGAGGGCT ACGAGGTCCG CATCCTCACC 42 0 

GCCGACAAAG ACCTTTACCA GCTCCTTTCC GACCGCATCC ACGTCCTCCA CCCCGAGGGG 4 80 

TACCTCATCA CCCCGGCCTG GCTTTGGGAA AAGTACGGCC TGAGGCCCGA CCAGTGGGCC 540 

GACTACCGGG CCCTGACCGG GGACGAGTCC GACAACCTTC CCGGGGTCAA GGGCATCGGG 600 

GAGAAGACGG CGAGGAAGCT TCTGGAGGAG TGGGGGAGCC TGGAAGCCCT CCTCAAGAAC 660 

CTGGACCGGC TGAAGCCCGC CATCCGGGAG AAGATCCTGG CCCACATGGA CGATCTGAAG 72 0 

CTCTCCTGGG ACCTGGCCAA GGTGCGCACC GACCTGCCCC TGGAGGTGGA CTTCGCCAAA 7 80 

AGGCGGGAGC CCGACCGGGA GAGGCTTAGG GCCTTTCTGG AGAGGCTTGA GTTTGGCAGC 840 

CTCCTCCACG AGTTCGGCCT TCTGGAAAGC CCCAAGGCCC TGGAGGAGGC CCCCTGGCCC 900 

CCGCCGGAAG GGGCCTTCGT GGGCTTTGTG CTTTCCCGCA AGGAGCCCAT GTGGGCCGAT 960 

CTTCTGGCCC TGGCCGCCGC CAGGGGGGGC CGGGTCCACC GGGCCCCCGA GCCTTATAAA 102 0 

GCCCTCAGGG ACCTGAAGGA GGCGCGGGGG CTTCTCGCCA AAGACCTGAG CGTTCTGGCC 1080 

CTGAGGGAAG GCCTTGGCCT CCCGCCCGGC GACGACCCCA TGCTCCTCGC CTACCTCCTG 1140 

GACCCTTCCA ACACCACCCC CGAGGGGGTG GCCCGGCGCT ACGGCGGGGA GTGGACGGAG 1200 

GAGGCGGGGG AGCGGGCCGC CCTTTCCGAG AGGCTCTTCG CCAACCTGTG GGGGAGGCTT 1260 

GAGGGGGAGG AGAGGCTCCT TTGGCTTTAC CGGGAGGTGG AGAGGCCCCT TTCCGCTGTC 1320 

CTGGCCCACA TGGAGGCCAC GGGGGTGCGC CTGGACGTGG CCTATCTCAG GGCCTTGTCC 1380 

CTGGAGGTGG CCGAGGAGAT CGCCCGCCTC GAGGCCGAGG TCTTCCGCCT GGCCGGCCAC 1440 

CCCTTCAACC TCAACTCCCG GGACCAGCTG GAAAGGGTCC TCTTTGACGA GCTAGGGCTT 1500 

CCCGCCATCG GCAAGACGGA GAAGACCGGC -AAGCGCTCCA CCAGCGCCGC CGTCCTGGAG 1560 

GCCCTCCGCG AGGCCCACCC CATCGTGGAG AAGATCCTGC AGTACCGGGA GCTCACCAAG 162 0 

CTGAAGAGCA CCTACATTGA CCCCTTGCCG GACCTCATCC ACCCCAGGAC GGGCCGCCTC 1680 

CACACCCGCT TCAACCAGAC GGCCACGGCC ACGGGCAGGC TAAGTAGCTC CGATCCCAAC 174 0 

CTCCAGAACA TCCCCGTCCG CACCCCGCTT GGGCAGAGGA TCCGCCGGGC CTTCATCGCC 180 0 

GAGGAGGGGT GGCTATTGGT GGCCCTGGAC TATAGCCAGA TAGAGCTCAG GGTGCTGGCC 1860 

CACCTCTCCG GCGACGAGAA CCTGATCCGG GTCTTCCAGG AGGGGCGGGA CATCCACACG 192 0 

GAGACCGCCA GCTGGATGTT CGGCGTCCCC CGGGAGGCCG TGGACCCCCT GATGCGCCGG 1980 

GCGGCCAAGA CCATCAACTT CGGGGTCCTC TACGGCATGT CGGCCCACCG CCTCTCCCAG 2 04 0 

GAGCTAGCCA TCCCTTACGA GGAGGCCCAG GCCTTCATTG AGCGCTACTT TCAGAGCTTC 210 0 

CCCAAGGTGC GGGCCTGGAT TGAGAAGACC CTGGAGGAGG GCAGGAGGCG GGGGTACGTG 216 0 
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GAGACCCTCT TCGGCCGCCG CCGCTACGTG CCAGACCTAG AGGCCCGGGT GAAGAGCGTG 2220 

CGGGAGGCGG CCGAGCGCAT GGCCTTCAAC ATGCCCGTCC AGGGCACCGC CGCCGACCTC 2280 

ATGAAGCTGG CTATGGTGAA GCTCTTCCCC AGGCTGGAGG AAATGGGGGC CAGGATGCTC 2340 

CTTCAGGTCC ACGACGAGCT GGTCCTCGAG GCCCCAAAAG AGAGGGCGGA GGCCGTGGCC 24 00 

CGGCTGGCCA AGGAGGTCAT GGAGGGGGTG TATCCCCTGG CCGTGCCCCT GGAGGTGGAG 2460 

GTGGGGATAG GGGAGGACTG GCTCTCCGCC AAGGAGTGAT ACCACC 2506 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGGCGATGC TTCCCCTCTT TGAGCCCAAA GGCCGCGTGC TCCTGGTGGA CGGCCACCAC 60 

CTGGCCTACC GCACCTTCTT TGCCCTCAAG GGCCTCACCA CCAGCCGCGG CGAACCCGTT 120 

CAGGCGGTCT ACGGCTTCGC CAAAAGCCTC CTCAAGGCCC TGAAGGAGGA CGGGGACGTG 180 

GTGGTGGTGG TCTTTGACGC CAAGGCCCCC TCCTTCCGCC ACGAGGCCTA CGAGGCCTAC 240 

AAGGCGGGCC GGGCCCCCAC CCCGGAGGAC TTTCCCCGGC AGCTGGCCCT CATCAAGGAG 3 00 

TTGGTGGACC TCCTAGGCCT TGTGCGGCTG GAGGTTCCCG GCTTTGAGGC GGACGACGTG 360 

CTGGCCACCC TGGCCAAGCG GGCGGAAAAG GAGGGGTACG AGGTGCGCAT CCTCACTGCC 42 0 

GACCGCGACC TCTACCAGCT CCTTTCGGAG CGCATCGCCA TCCTCCACCC TGAGGGGTAC 4 80 

CTGATCACCC CGGCGTGGCT TTACGAGAAG TACGGCCTGC GCCCGGAGCA GTGGGTGGAC 540 

TACCGGGCCC TGGCGGGGGA CCCCTCGGAT AACATCCCCG GGGTGAAGGG CATCGGGGAG 600 

AAGACCGCCC AGAGGCTCAT CCGCGAGTGG GGGAGCCTGG AAAACCTCTT CCAGCACCTG 660 

GACCAGGTGA AGCCCTCCTT GCGGGAGAAG CTCCAGGCGG GCATGGAGGC CCTGGCCCTT 720 

TCCCGGAAGC TTTCCCAGGT GCACACTGAC CTGCCCCTGG AGGTGGACTT CGGGAGGCGC 780 

CGCACACCCA ACCTGGAGGG TCTGCGGGCT TTTTTGGAGC GGTTGGAGTT TGGAAGCCTC 840 

CTCCACGAGT TCGGCCTCCT GGAGGGGCCG AAGGCGGCAG AGGAGGCCCC CTGGCCCCCT 900 

CCGGAAGGGG CTTTTTTGGG CTTTTCCTTT TCCCGTCCCG AGCCCATGTG GGCCGAGCTT 960 

CTGGCCCTGG CTGGGGCGTG GGAGGGGCGC CTCCATCGGG CACAAGACCC CCTTAGGGGC 102 0 

CTGAGGGACC TTAAGGGGGT GCGGGGAATC CTGGCCAAGG ACCTGGCGGT TTTGGCCCTG 1080 

CGGGAGGGCC TGGACCTCTT CCCAGAGGAC GACCCCATGC TCCTGGCCTA CCTTCTGGAC 114 0 
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CCCTCCAACA 


CCACCCCTGA 


GGGGGTGGCC 


CGGCGTTACG 


GGGGGGAGTG 


GACGGAGGAT 


1200 


GCGGGGGAGA 


GGGCCCTCCT 


GGCCGAGCGC 


CTCTTCCAGA 


CCCTAAAGGA 


GCGCCTTAAG 


1260 


GGAGAAGAAC 


GCCTGCTTTG 


GCTTTACGAG 


GAGGTGGAGA 


AGCCGCTTTC 


CCGGGTGTTG 


1320 


GCCCGGATGG 


AGGCCACGGG 


GGTCCGGCTG 


GACGTGGCCT 


ACCTCCAGGC 


CCTCTCCCTG 


1380 


GAGGTGGAGG 


CGGAGGTGCG 


CCAGCTGGAG 


GAGGAGGTCT 


TCCGCCTGGC 


CGGCCACCCC 


1440 


TTCAACCTCA 


ACTCCCGCGA 


CCAGCTGGAG 


CGGGTGCTCT 


TTGACGAGCT 


GGGCCTGCCT 


1500 


GCCATCGGCA 


AGACGGAGAA 


GACGGGGAAA 


CGCTCCACCA 


GCGCTGCCGT 


GCTGGAGGCC 


1560 


CTGCGAGAGG 


CCCACCCCAT 


m f4 "fV f% 

CGTGGACCGC 


ATCCTGCAGT 


ACCGGGAGCT 


CACCAAGCTC 


1620 


AAGAACACCT 


ACATAGACCC 


CCTGCCCGCC 


CTGGTCCACC 


CCAAGACCGG 


CCGGCTCCAC 


1680 


ACCCGCTTCA 


ACCAGACGGC 


CACCGCCACG 


GGCAGGCTTT 


CCAGCTCCGA 


CCCCAACCTG 


1740 


CAGAACATCC 


CCGTGCGCAC 


CCCTCTGGGC 


CAGCGCATCC 


GCCGAGCCTT 


CGTGGCCGAG 


1800 


GAGGGCTGGG 


TGCTGGTGGT 


CTTGGACTAC 


AGCCAGATTG 


AGCTTCGGGT 


CCTGGCCCAC 


1860 


CTCTCCGGGG 


ACGAGAACCT 


GATCCGGGTC 


TTTCAGGAGG 


GGAGGGACAT 


CCACACCCAG 


1920 


ACCGCCAGCT 


GGATGTTCGG 


CGTTTCCCCC 


GAAGGGGTAG 


ACCCTCTGAT 


GCGCCGGGCG 


1980 


GCCAAGACCA 


TCAACTTCGG 


GGTGCTCTAC 


GGCATGTCCG 


CCCACCGCCT 


CTCCGGGGAG 


2040 


CTTTCCATCC 


CCTACGAGGA 


GGCGGTGGCC 


TTCATTGAGC 


GCTACTTCCA 


GAGCTACCCC 


2100 


AAGGTGCGGG 


CCTGGATTGA 


GGGGACCCTC 


GAGGAGGGCC 


GCCGGCGGGG 


GTATGTGGAG 


2160 


ACCCTCTTCG 


GCCGCCGGCG 


CTATGTGCCC 


GACCTCAACG 


CCCGGGTGAA 


GAGCGTGCGC 


2220 


GAGGCGGCGG 


AGCGCATGGC 


CTTCAACATG 


CCGGTCCAGG 


GCACCGCCGC 


CGACCTCATG 


2280 


AAGCTGGCCA 


TGGTGCGGCT 


TTTCCCCCGG 


CTTCAGGAAC 


TGGGGGCGAG 


GATGCTTTTG 


2340 


CAGGTGCACG 


ACGAGCTGGT 


CCTCGAGGCC 


CCCAAGGACC 


GGGCGGAGAG 


GGTAGCCGCT 


2400 


TTGGCCAAGG 


AGGTCATGGA 


GGGGGTCTGG 


CCCCTGCAGG 


TGCCCCTGGA 


GGTGGAGGTG 


2460 


GGCCTGGGGG 


AGGACTGGCT 


CTCCGCCAAG 


GAGTAG 






2496 


(2) INFORMATION FOR SEQ ID NO : 3 : 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



ATGGAGGCGA 


TGCTTCCGCT 


CTTTGAACCC 


AAAGGCCGGG 


TCCTCCTGGT 


y« y* TV yi yi yi TV y^l 

GGACGGCCAC 


60 


CACCTGGCCT 


ACCGCACCTT 


CTTCGCCCTG 


AAGGGCCTCA 


CCACGAGCCG 


GGGCGAACCG 


120 


GTGCAGGCGG 


TCTACGGCTT 


CGCCAAGAGC 


CTCCTCAAGG 


yi yi m/"*1 TV TA ^^y*l TV 

CCCTGAAGGA 


GGACGGGTAC 


180 


AAGGCCGTCT 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGC CTACGAG 


240 


GCCTACAAGG 


CGGGGAGGGC 


CCCGACCCCC 


GAGGACTTCC 


CCCGGCAGCT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGTTTACC 


CGCCTCGAGG 


TCCCCGGCTA 


CGAGGCGGAC 


360 


GACGTTCTCG 


CCACCCTGGC 


CAAGAAGGCG 


***m TV TV TV TV y**1 /^t TV 

GAAAAGGAGG 


GGTACGAGGT 


y« y»l yr TV ^n^ly^l^n#"< 

GCGCATCCTC 


420 


ACCGCCGACC 


GCGACCTCTA 


CCAACTCGTC 


TCCGACCGCG 


TCGCCGTCCT 


y*iy*TV y***) yi yi yi y*T TV y*T 

CCACCCCGAG 


480 


GGCCACCTCA 


TCACCCCGGA 


GTGGCTTTGG 


GAGAAGTACG 


GCCTCAGGCC 


GGAGCAGTGG 


540 


GTGGACTTCC 


GCGCCCTCGT 


GGGGGACCCC 


TCCGACAACC 


TCCCCGGGGT 


y« TV TV y* /"I A yi v m VT 

CAAGGGCATC 


600 


GGGGAGAAGA 


CCGCCCTCAA 


GCTCCTCAAG 


GAGTGGGGAA 


^to^lm^l^lit TV TV TV 

GCCTGGAAAA 


yi /•iffi yi yr.m yi TV Tv yi 

CCTCCTCAAG 


660 


AACCTGGACC 


GGGTAAAGCC 


AGAAAACGTC 


y*T /N y*T y*"T TV y^( TV TV TV 

CGGGAGAAGA 


PTT yT TV TV yi yi y« o yi TV 

TCAAGGCCCA 


n^im/T^I TV TV yi TV yi 

CCTGGAAGAC 


720 


CTCAGGCTCT 


CCTTGGAGCT 


CTCCCGGGTG 


y*Ty"*i y*T TV y^y^ly^iTV yT y*"* 

CGCACCGACC 


m y< yi yi yi yi m yi yi tv 

TCCCCCTGGA 


GGTGGACCTC 


780 


GCCCAGGGGC 


GGGAGCCCGA 


CCGGGAGGGG 


CTTAGGGCCT 


TCCTGGAGAG 


GCTGGAGTTC 


840 


GGCAGCCTCC 


TCCACGAGTT 


CGGCCTCCTG 


GAGGCCCCCG 


CCCCCCTGGA 


GGAGGCCCCC 


900 


TGGCCCCCGC 


CGGAAGGGGC 


CTTCGTGGGC 


TTCGTCCTCT 


CCCGCCCCGA 


GCCCATGTGG 


960 


GCGGAGCTTA 


AAGCCCTGGC 


CGCCTGCAGG 


yj^i TV yr yT y^** yi y* y*i yr 

GACGGCCGGG 


TGCACCGGGC 


Tv y""i y^i TV y^l TV o^l^t^l 

AGCAGACCCC 


1020 


TTGGCGGGGC 


TAAAGGACCT 


/"■•« TV TV /"*• TV /^l ^1 ^n*^ 

CAAGGAGGTC 


CGGGGCCTCC 


TCGCCAAGGA 


CCTCGCCGTC 


1080 


TTGGCCTCGA 


GGGAGGGGCT 


AGACCTCGTG 


CCCGGGGACG 


ACCCCATGCT 


^tmy^ty^i y^i ^tmTV y^t 

CCTCGCCTAC 


1140 


CTCCTGGACC 


CCTCCAACAC 


CACCCCCGAG 


GGGGTGGCGC 


GGCGCTACGG 


GGGGGAGTGG 


1200 


ACGGAGGACG 


CCGCCCACCG 


GGCCCTCCTC 


TCGGAGAGGC 


TCCATCGGAA 


CCTCCTTAAG 


1260 


CGCCTCGAGG 


GGGAGGAGAA 


GCTCCTTTGG 


CTCTACCACG 


AGGTGGAAAA 


GCCCCTCTCC 


1320 


CGGGTCCTGG 


CCCACATGGA 


GGCCACCGGG 


GTACGGCTGG 


ACGTGGCCTA 


CCTTCAGGCC 


1380 


CTTTCCCTGG 


AGCTTGCGGA 


GGAGATCCGC 


CGCCTCGAGG 


AGGAGGTCTT 


CCGCTTGGCG 


1440 


GGCCACCCCT 


TCAACCTCAA 


CTCCCGGGAC 


CAGCTGGAAA 


GGGTGCTCTT 


TGACGAGCTT 


1500 


AGGCTTCCCG 


CCTTGGGGAA 


GACGCAAAAG 


ACAGGCAAGC 


GCTCCACCAG 


CGCCGCGGTG 


1560 


CTGGAGGCCC 


TACGGGAGGC 


CCACCCCATC 


GTGGAGAAGA 


TCCTCCAGCA 


CCGGGAGCTC 


1620 


ACCAAGCTCA 


AGAACACCTA 


CGTGGACCCC 


CTCCCAAGCC 


TCGTCCACCC 


GAGGACGGGC 


1680 


CGCCTCCACA 


CCCGCTTCAA 


CCAGACGGCC 


ACGGCCACGG 


GGAGGCTTAG 


TAGCTCCGAC 


1740 


CCCAACCTGC 


AGAACATCCC 


CGTCCGCACC 


CCCTTGGGCC 


AGAGGATCCG 


CCGGGCCTTC 


1800 



t 
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GTGGCCGAGG 


CGGGTTGGGC 


GTTGGTGGCC 


CTGGACTATA 


ft ft ft H ft fA ft m 

GCCAGATAGA 


f* ft^n ft ft ft ft ft ft 

GCTCCGCGTC 


1860 


CTCGCCCACC 


TCTCCGGGGA 


CGAAAACCTG 


ATCAGGGTCT 


TCCAGGAGGG 


GAAGGACATC 


1920 


CACACCCAGA 


CCGCAAGCTG 


GATGTTCGGC 


GTCCCCCCGG 


AGGCCGTGGA 


CCCCCTGATG 


1980 


CGCCGGGCGG 


CCAAGACGGT 


ft -n, *yt /*(mfn/^ ft ft ft 

GAACTTCGGC 


ft ^1 ft ft m ft m TV y*"* 

GTCCTCTACG 


ft ft tv ^nft^n ft ftf*t ft 

GCATGTCCGC 


CCATAGGCTC 


2040 


m ft ft i^t -ft ft ft fi y"l 

TCCCAGGAGC 


TTGCCATCCC 


CTACGAGGAG 


ft ft m ft ft ft ft m 

GCGGTGGCCT 


mm tv m tv ^"i tv /— i 

TTATAGAGGC 


m tv nmrrrinn tv tv 

TACTTCCAAA 


2100 


GCTTCCCCAA 


GGTGCGGGCC 


rn/"*i /^i tv rn tv /""I tv tv tv 

TGGATAGAAA 


AGACCCTGGA 


GGAGGGGAGG 


AAGCGGGGCT 


2160 


^ ft rryftft tv tv t* ft 

ACGTGGAAAC 


CCTCTTCGGA 


AGAAGGCGCT 


ACGTGCCCGA 


CCTCAACGCC 


/"(/•i^i/^my^t tv tv /""i tv 

CGGGTGAAGA 


2220 


GCGTCAGGGA 


GGCCGCGGAG 


CGCATGGCCT 


TCAACATGCC 


CGTCCAGGGC 


TrV ft ft ft ft ft ft ft ft 

ACCGCCGCCG 


2280 


ACCTCATGAA 


GCTCGCCATG 


GTGAAGCTCT 


TCCCCCGCCT 


CCGGGAGATG 


GGGGCCCGCA 


2340 


TGCTCCTCCA 


GGTCCACGAC 


GAGCTCCTCC 


TGGAGGCCCC 


CCAAGCGCGG 


GCCGAGGAGG 


2400 


TGGCGGCTTT 


GGCCAAGGAG 


GCCATGGAGA 


AGGCCTATCC 


CCTCGCCGTG 


CCCCTGGAGG 


2460 


TGGAGGTGGG 


GATGGGGGAG 


GACTGGCTTT 


CCGCCAAGGG 


TTAG 




2504 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp 
130 135 140 
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Leu Tyr Gin Leu Leu Ser Asp 
145 150 

Tyr Leu lie Thr Pro Ala Trp 
165 

Asp Gin Trp Ala Asp Tyr Arg 
180 

Leu Pro Gly Val Lys Gly lie 
195 

Glu Glu Trp Gly Ser Leu Glu 
210 215 

Lys Pro Ala lie Arg Glu Lys 
225 230 

Leu Ser Trp Asp Leu Ala Lys 
245 

Asp Phe Ala Lys Arg Arg Glu 
260 

Leu Glu Arg Leu Glu Phe Gly 
275 

Glu Ser Pro Lys Ala Leu Glu 
290 295 

Ala Phe Val Gly Phe Val Leu 
305 310 

Leu Leu Ala Leu Ala Ala Ala 
325 

Glu Pro Tyr Lys Ala Leu Arg 
340 

Ala Lys Asp Leu Ser Val Leu 
355 

Pro Gly Asp Asp Pro Met Leu 
370 375 

Thr Thr Pro Glu Gly Val Ala 
385 390 

Glu Ala Gly Glu Arg Ala Ala 
405 

Trp Gly Arg Leu Glu Gly Glu 
420 

Val Glu Arg Pro Leu Ser Ala 
435 

Val Arg Leu Asp Val Ala Tyr 
450 455 

Glu Glu lie Ala Arg Leu Glu 
465 470 



Arg lie His 

Leu Trp Glu 
170 

Ala Leu Thr 
185 

Gly Glu Lys 
200 

Ala Leu Leu 

lie Leu Ala 

Val Arg Thr 
250 

Pro Asp Arg 
265 

Ser Leu Leu 
280 

Glu Ala Pro 

Ser Arg Lys 

Arg Gly Gly 
330 

Asp Leu Lys 
345 

Ala Leu Arg 
360 

Leu Ala Tyr 

Arg Arg Tyr 

Leu Ser Glu 
410 

Glu Arg Leu 
425 

Val Leu Ala 
440 

Leu Arg Ala 
Ala Glu Val 



Val Leu 
155 

Lys Tyr 

Gly Asp 

Thr Ala 

Lys Asn 
220 

His Met 
235 

Asp Leu 

Glu Arg 

His Glu 

Trp Pro 
300 

Glu Pro 
315 

Arg Val 

Glu Ala 

Glu Gly 

Leu Leu 
380 

Gly Gly 
395 

Arg Leu 
Leu Trp 
His Met 



His Pro Glu Gly 
160 

Gly Leu Arg Pro 
175 

Glu Ser Asp Asn 
190 

Arg Lys Leu Leu 
205 

Leu Asp Arg Leu 



Asp Asp Leu Lys 
240 

Pro Leu Glu Val 
255 

Leu Arg Ala Phe 
270 

Phe Gly Leu Leu 
285 

Pro Pro Glu Gly 



Met Trp Ala Asp 
320 

His Arg Ala Pro 
335 

Arg Gly Leu Leu 
350 

Leu Gly Leu Pro 
365 

Asp Pro Ser Asn 



Leu Ser 
460 

Phe Arg 
475 



Glu Trp Thr Glu 
400 

Phe Ala Asn Leu 
415 

Leu Tyr Arg Glu 
430 

Glu Ala Thr Gly 
445 

Leu Glu Val Ala 



Leu Ala Gly His 
480 
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Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His Thr 
625 630 635 640 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 



Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
785 790 795 800 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Val Val Val Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Arg Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ala He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Tyr Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Gin Arg Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Phe Gin His Leu Asp Gin Val Lys 
210 215 220 
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• 



Pro Ser 
225 

Ser Arg 
Phe Gly 
Glu Arg 



Leu Arg 
Lys Leu 



Arg Arg 
260 

Leu Glu 
275 



Glu Lys 
230 

Ser Gin 
245 



Leu Gin 
Val His 



Gly Pro 
290 



Lys Ala 
Gly Phe 
Leu Ala Leu Ala 



Arg Thr Pro Asn 
Phe Gly 
Ala Glu 



Ser Leu 
280 



Phe Leu 
305 



Pro Leu 

Lys Asp 

Glu Asp 
370 

Thr Pro 
385 

Ala Gly 
Glu Arg 
Glu Lys 



Arg Leu 
450 

Glu Val 
465 



Arg Gly 
340 

Leu Ala 
355 



Ser Phe 
310 

Gly Ala 
325 

Leu Arg 
Val Leu 



Asp Pro Met Leu 
Glu Gly 
Glu Arg 



Val Ala 
390 



Leu Lys 
420 

Pro Leu 
435 

Asp Val 
Arg Gin 



Phe Asn Leu Asn 
Leu Gly 
Thr Ser 



Leu Pro 
500 



Asp Arg 
530 

lie Asp 
545 



Ala Ala 
515 

lie Leu 



Pro Leu 



Ala Leu 
405 

Gly Glu 

Ser Arg 

Ala Tyr 

Leu Glu 
470 

Ser Arg 
485 

Ala He 
Val Leu 
Gin Tyr 



Pro Ala 
550 



Glu Ala 
295 

Ser Arg 

Trp Glu 

Asp Leu 

Ala Leu 
360 

Leu Ala 
375 

Arg Arg 
Leu Ala 
Glu Arg 



Val Leu 
440 

Leu Gin 
455 

Glu Glu 



Asp Gin 
Gly Lys 



Glu Ala 
520 

Arg Glu 
535 

Leu Val 



Ala Gly Met 
235 

Thr Asp Leu 
250 

Leu Glu Gly 
265 

Leu His Glu 



Pro Trp Pro 

Pro Glu Pro 
315 

Gly Arg Leu 
330 

Lys Gly Val 
345 

Arg Glu Gly 

Tyr Leu Leu 

Tyr Gly Gly 
395 

Glu Arg Leu 
410 

Leu Leu Trp 
425 

Ala Arg Met 

Ala Leu Ser 

Val Phe Arg 
475 

Leu Glu Arg 
490 

Thr Glu Lys 
505 

Leu Arg Glu 
Leu Thr Lys 



Glu Ala Leu 



Pro Leu Glu 



Leu Arg Ala 
270 

Phe Gly Leu 
285 

Pro Pro Glu 
300 

Met Trp Ala 



His Arg Ala 



Arg Gly He 
350 

Leu Asp Leu 
365 

Asp Pro Ser 
380 

Glu Trp Thr 



Phe Gin Thr 



Leu Tyr Glu 
430 

Glu Ala Thr 
445 

Leu Glu Val 
460 

Leu Ala Gly 



Val Leu Phe 



His Pro Lys 
555 



Thr Gly Lys 
510 

Ala His Pro 
525 

Leu Lys Asn 
540 

Thr Gly Arg 



Ala Leu 
240 

Val Asp 
255 

Phe Leu 

Leu Glu 

Gly Ala 

Glu Leu 
320 

Gin Asp 
335 

Leu Ala 

Phe Pro 

Asn Thr 

Glu Asp 
400 

Leu Lys 
415 

Glu Val 

Gly Val 

Glu Ala 

His Pro 
480 

Asp Glu 
495 

Arg Ser 
He Val 
Thr Tyr 



Leu His 
560 
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Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 575 

Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 590 

lie Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Val Leu Val Val Leu 
595 600 605 

Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 

Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His Thr Gin 
625 630 635 640 

Thr Ala Ser Trp Met Phe Gly Val Ser Pro Glu Gly Val Asp Pro Leu 
645 650 655 

Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly Met 
660 665 670 

Ser Ala His Arg Leu Ser Gly Glu Leu Ser lie Pro Tyr Glu Glu Ala 
675 680 685 

Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg Ala 
690 695 700 

Trp lie Glu Gly Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu 
705 710 715 720 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val 
725 730 735 

Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 
740 745 750 

Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Arg Leu Phe 
755 760 765 

Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His Asp 
770 775 780 

Glu Leu Val Leu Glu Ala Pro Lys Asp Arg Ala Glu Arg Val Ala Ala 
785 790 795 800 

Leu Ala Lys Glu Val Met Glu Gly Val Trp Pro Leu Gin Val Pro Leu 
805 810 815 

Glu Val Glu Val Gly Leu Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg 
130 135 140 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 160 

Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu Glu Asp 
225 230 235 240 

Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 
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Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 
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Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



ATGNNGGCGA 


TGCTTCCCCT 


CTTTGAGCCC 


AAAGGCCGGG 


TCCTCCTGGT 


GGACGGCCAC 


60 


CACCTGGCCT 


ACCGCACCTT 


CTTCGCCCTG 


AAGGGCCTCA 


CCACCAGCCG 


GGGCGAACCG 


120 


GTGCAGGCGG 


TCTACGGCTT 


CGCCAAGAGC 


CTCCTCAAGG 


CCCTGAAGGA 


GGACGGGGAC 


180 


NNGGCGGTGN 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGAG 


240 


GCCTACAAGG 


CGGGCCGGGC 


CCCCACCCCG 


GAGGACTTTC 


CCCGGCAGCT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGCTTGCG 


CGCCTCGAGG 


TCCCCGGCTA 


CGAGGCGGAC 


360 


GACGTNCTGG 


CCACCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GGTACGAGGT 


GCGCATCCTC 


420 
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ACCGCCGACC GCGACCTCTA CCAGCTCCTT TCCGACCGCA TCGCCGTCCT CCACCCCGAG 480 

GGGTACCTCA TCACCCCGGC GTGGCTTTGG GAGAAGTACG GCCTGAGGCC GGAGCAGTGG 540 

GTGGACTACC GGGCCCTGGC GGGGGACCCC TCCGACAACC TCCCCGGGGT CAAGGGCATC 600 

GGGGAGAAGA CCGCCCNGAA GCTCCTCNAG GAGTGGGGGA GCCTGGAAAA CCTCCTCAAG 660 

AACCTGGACC GGGTGAAGCC CGCCNTCCGG GAGAAGATCC AGGCCCACAT GGANGACCTG 72 0 

ANGCTCTCCT GGGAGCTNTC CCAGGTGCGC ACCGACCTGC CCCTGGAGGT GGACTTCGCC 780 

AAGNGGCGGG AGCCCGACCG GGAGGGGCTT AGGGCCTTTC TGGAGAGGCT GGAGTTTGGC 84 0 

AGCCTCCTCC ACGAGTTCGG CCTCCTGGAG GGCCCCAAGG CCCTGGAGGA GGCCCCCTGG 90 0 

CCCCCGCCGG AAGGGGCCTT CGTGGGCTTT GTCCTTTCCC GCCCCGAGCC CATGTGGGCC 960 

GAGCTTCTGG CCCTGGCCGC CGCCAGGGAG GGCCGGGTCC ACCGGGCACC AGACCCCTTT 102 0 

ANGGGCCTNA GGGACCTNAA GGAGGTGCGG GGNCTCCTCG CCAAGGACCT GGCCGTTTTG 1080 

GCCCTGAGGG AGGGCCTNGA CCTCNTGCCC GGGGACGACC CCATGCTCCT CGCCTACCTC 1140 

CTGGACCCCT CCAACACCAC CCCCGAGGGG GTGGCCCGGC GCTACGGGGG GGAGTGGACG 12 00 

GAGGANGCGG GGGAGCGGGC CCTCCTNTCC GAGAGGCTCT TCCNGAACCT NNNGCAGCGC 12 60 

CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT TACCAGGAGG TGGAGAAGCC CCTTTCCCGG 132 0 

GTCCTGGCCC ACATGGAGGC CACGGGGGTN CGGCTGGACG TGGCCTACCT CCAGGCCCTN 13 80 

TCCCTGGAGG TGGCGGAGGA GATCCGCCGC CTCGAGGAGG AGGTCTTCCG CCTGGCCGGC 144 0 

CACCCCTTCA ACCTCAACTC CCGGGACCAG CTGGAAAGGG TGCTCTTTGA CGAGCTNGGG 1500 

CTTCCCGCCA TCGGCAAGAC GGAGAAGACN GGCAAGCGCT CCACCAGCGC CGCCGTGCTG 1560 

GAGGCCCTNC GNGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGTACCG GGAGCTCACC 162 0 

AAGCTCAAGA ACACCTACAT NGACCCCCTG CCNGNCCTCG TCCACCCCAG GACGGGCCGC 1680 

CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA GGCTTAGTAG CTCCGACCCC 1740 

AACCTGCAGA ACATCCCCGT CCGCACCCCN CTGGGCCAGA GGATCCGCCG GGCCTTCGTG 180 0 

GCCGAGGAGG GNTGGGTGTT GGTGGCCCTG GACTATAGCC AGATAGAGCT CCGGGTCCTG 1860 

GCCCACCTCT CCGGGGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGAG GGACATCCAC 1920 

ACCCAGACCG CCAGCTGGAT GTTCGGCGTC CCCCCGGAGG CCGTGGACCC CCTGATGCGC 1980 

CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA TGTCCGCCCA CCGCCTCTCC 204 0 

CAGGAGCTTG CCATCCCCTA CGAGGAGGCG GTGGCCTTCA TTGAGCGCTA CTTCCAGAGC 2100 

TTCCCCAAGG TGCGGGCCTG GATTGAGAAG ACCCTGGAGG AGGGCAGGAG GCGGGGGTAC 2160 

GTGGAGACCC TCTTCGGCCG CCGGCGCTAC GTGCCCGACC TCAACGCCCG GGTGAAGAGC 222 0 

GTGCGGGAGG CGGCGGAGCG CATGGCCTTC AACATGCCCG TCCAGGGCAC CGCCGCCGAC 22 80 
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CTCATGAAGC TGGCCATGGT GAAGCTCTTC CCCCGGCTNC AGGAAATGGG GGCCAGGATG 234 0 

CTCCTNCAGG TCCACGACGA GCTGGTCCTC GAGGCCCCCA AAGAGCGGGC GGAGGNGGTG 2400 

GCCGCTTTGG CCAAGGAGGT CATGGAGGGG GTCTATCCCC TGGCCGTGCC CCTGGAGGTG 2460 

GAGGTGGGGA TGGGGGAGGA CTGGCTCTCC GCCAAGGAGT AG 2502 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Xaa Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val Xaa Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Xaa Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg lie Ala Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Glu Gin Trp Val Asp Tyr Arg Ala Leu Xaa Gly Asp Pro Ser Asp Asn 
180 185 190 

Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Xaa Lys Leu Leu 
195 200 205 

Xaa Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val 
210 215 220 
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Lys Pro Xaa Xaa Arg Glu Lys lie Xaa Ala His Met 
225 230 235 



Leu Ser Xaa Xaa 



Asp Phe 
Leu Glu 



Ala Xaa 
260 

Arg Leu 
275 



Glu Xaa 
290 



Pro Lys 
Val Gly 
Leu Leu Ala Leu 



Ala Phe 
305 



Asp Pro 
Ala Lys 



Leu Xaa 
340 

Asp Leu 
355 



Leu Ser Xaa Val 
245 

Arg Arg Glu Pro 



Glu Phe Gly Ser 
280 

Ala Leu Glu Glu 
295 

Phe Val Leu Ser 
310 

Ala Ala Ala Arg 
325 

Gly Leu Arg Asp 



Arg Thr 
250 



Asp Leu 
Glu Gly 
Leu Leu His Glu 



Asp Arg 
265 



Pro Gly 
370 



Asp Asp 
Pro Glu 
Asp Ala Gly Glu 



Thr Thr 
385 



Xaa Xaa 



Val Glu 



Val Arg 
450 

Glu Glu 
465 



Arg Leu 
420 

Lys Pro 
435 

Leu Asp 
lie Arg 



Pro Phe Asn Leu 
Glu Leu 
Ser Thr 



Gly Leu 
500 



Val Glu 
530 

Tyr He 
545 



Ser Ala 
515 

Lys He 
Asp Pro 



Ala Val Leu Ala 
360 

Pro Met Leu Leu 
375 

Gly Val Ala Arg 
390 

Arg Ala Leu Leu 
405 

Glu Gly Glu Glu 



Leu Ser Arg Val 
440 

Val Ala Tyr Leu 
455 

Arg Leu Glu Glu 
470 

Asn Ser Arg Asp 
485 

Pro Ala He Gly 



Ala Val Leu Glu 
520 

Leu Gin Tyr Arg 
535 

Leu Pro Xaa Leu 
550 



Ala Pro 

Arg Pro 

Xaa Gly 
330 

Leu Lys 
345 

Leu Arg 

Ala Tyr 

Arg Tyr 

Ser Glu 
410 

Arg Leu 
425 

Leu Ala 

Gin Ala 

Glu Val 

Gin Leu 
490 

Lys Thr 
505 

Ala Leu 
Glu Leu 
Val His 



Trp Pro 
300 

Glu Pro 
315 

Arg Val 

Glu Val 

Glu Gly 

Leu Leu 
380 

Gly Gly 
395 

Arg Leu 

Leu Trp 

His Met 

Leu Ser 
460 

Phe Arg 
475 

Glu Arg 
Glu Lys 
Arg Glu 



Thr Lys 
540 

Pro Arg 
555 



Glu Asp Leu Xaa 
240 

Pro Leu Glu Val 
255 

Leu Arg Ala Phe 
270 

Phe Gly Leu Leu 
285 

Pro Pro Glu Gly 



Met Trp Ala Glu 
320 

His Arg Ala Xaa 
335 

Arg Gly Leu Leu 
350 

Leu Asp Leu Xaa 
365 

Asp Pro Ser Asn 



Glu Trp Thr Glu 
400 

Phe Xaa Asn Leu 
415 

Leu Tyr Xaa Glu 
430 

Glu Ala Thr Gly 
445 

Leu Glu Val Ala 



Leu Ala Gly His 
480 

Val Leu Phe Asp 
495 

Thr Gly Lys Arg 
510 

Ala His Pro He 
525 

Leu Lys Asn Thr 



Thr Gly Arg Leu 
560 
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His Thr 
Ser Asp 
Arg lie 



Leu Asp 
610 

Asp Glu 
625 

Gin Thr 



Arg Phe Asn 
565 

Pro Asn Leu 
580 

Arg Arg Ala 
595 

Tyr Ser Gin 



Asn Leu lie 



Leu Met 



Met Ser 



Ala Val 
690 

Ala Trp 
705 

Glu Thr 



Val Lys 
Val Gin 



Phe Pro 
770 

Asp Glu 
785 

Ala Leu 



Ala Ser Trp 
645 

Arg Arg Ala 
660 

Ala His Arg 
675 

Ala Phe He 



He Glu Lys 

Leu Phe Gly 
725 

Ser Val Arg 
740 

Gly Thr Ala 
755 

Arg Leu Xaa 
Leu Val Leu 



Leu Glu 
Xaa 



Ala Lys Glu 
805 

Val Glu Val 
820 



Gin Thr Ala 

Gin Asn He 

Phe Val Ala 
600 

He Glu Leu 
615 

Arg Val Phe 
630 

Met Phe Gly 

Ala Lys Thr 

Leu Ser Gin 
680 

Glu Arg Tyr 
695 

Thr Leu Glu 
710 

Arg Arg Arg 

Glu Ala Ala 

Ala Asp Leu 
760 

Glu Met Gly 
775 

Glu Ala Pro 
790 

Val Met Glu 
Gly Xaa Gly 



Thr Ala Thr Gly 
570 

Pro Val Arg Thr 
585 

Glu Glu Gly Trp 



Arg Val Leu Ala 
620 

Gin Glu Gly Arg 
635 

Val Pro Pro Glu 
650 

He Asn Phe Gly 
665 

Glu Leu Ala He 



Phe Gin Ser Phe 
700 

Glu Gly Arg Arg 
715 

Tyr Val Pro Asp 
730 

Glu Arg Met Ala 
745 

Met Lys Leu Ala 



Arg Leu Ser Ser 
575 

Pro Leu Gly Gin 
590 

Xaa Leu Val Ala 
605 

His Leu Ser Gly 



Asp He His Thr 
640 

Ala Val Asp Pro 
655 

Val Leu Tyr Gly 
670 

Pro Tyr Glu Glu 
685 

Pro Lys Val Arg 



Ala Arg Met Leu 
780 

Lys Xaa Arg Ala 
795 

Gly Val Tyr Pro 
810 

Glu Asp Trp Leu 
825 



Arg Gly Tyr Val 
720 

Leu Asn Ala Arg 
735 

Phe Asn Met Pro 
750 

Met Val Lys Leu 
765 

Leu Gin Val His 



Glu Xaa Val Ala 
800 

Leu Ala Val Pro 
815 

Ser Ala Lys Glu 
830 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



ATGAATTCGG 


GGATGCTGCC 


CCTCTTTGAG 


CCCAAGGGCC 


GGGTCCTCCT 


GGTGGACGGC 


60 


CACCACCTGG 


CCTACCGCAC 


CTTCCACGCC 


CTGAAGGGCC 


TCACCACCAG 


CCGGGGGGAG 


12 0 


CCGGTGCAGG 


CGGTCTACGG 


CTTCGCCAAG 


AGCCTCCTCA 


AGGCCCTCAA 


^1^1 TV V*T /"l TV y^ y^l y"*< 

GGAGGACGGG 


180 


GACGCGGTGA 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGGG 


240 


GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 


GAGGACTTTC 


CCCGGCAACT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGCTGGCG 


CGCCTCGAGG 


TCCCGGGCTA 


CGAGGCGGAC 


360 


GACGTCCTGG 


CCAGCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GCTACGAGGT 


CCGCATCCTC 


420 


ACCGCCGACA 


AAGACCTTTA 


CCAGCTCCTT 


TCCGACCGCA 


TCCACGTCCT 


CCACCCCGAG 


480 


GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG. 


GAAAAGTACG 


GCCTGAGGCC 


CGACCAGTGG 


540 


GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


TCCGACAACC 


TTCCCGGGGT 


y^t *h Tv y^i y"<i y^i y^i TV y""i 

CAAGGGCATC 


600 


GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


GAGTGGGGGA 


GCCTGGAAGC 


CCTCCTCAAG 


660 


AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 


GAGAAGATCC 


TGGCCCACAT 


GGACGATCTG 


720 


AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 


AAAAGGCGGG 


AGCCCGACCG 


GGAGAGGCTT 


AGGGCCTTTC 


TGGAGAGGCT 


TGAGTTTGGC 


840 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


AGCCCCAAGG 


CCCTGGAGGA 


GGCCCCCTGG 


900 


CCCCCGCCGG 


AAGGGGCCTT 


CGTGGGCTTT 


GTGCTTTCCC 


GCAAGGAGCC 


CATGTGGGCC 


960 


GATCTTCTGG 


CCCTGGCCGC 


CGCCAGGGGG 


GGCCGGGTCC 


ACCGGGCCCC 


CGAGCCTTAT 


1020 


AAAGCCCTCA 


GGGACCTGAA 


GGAGGCGCGG 


GGGCTTCTCG 


CCAAAGACCT 


GAGCGTTCTG 


1080 


GCCCTGAGGG 


AAGGCCTTGG 


CCTCCCGCCC 


GGCGACGACC 


CCATGCTCCT 


CGCCTACCTC 


1140 


CTGGACCCTT 


CCAACACCAC 


CCCCGAGGGG 


GTGGCCCGGC 


GCTACGGCGG 


GGAGTGGACG 


1200 


GAGGAGGCGG 


GGGAGCGGGC 


CGCCCTTTCC 


GAGAGGCTCT 


TCGCCAACCT 


GTGGGGGAGG 


1260 


CTTGAGGGGG 


AGGAGAGGCT 


CCTTTGGCTT 


TACCGGGAGG 


TGGAGAGGCC 


CCTTTCCGCT 


1320 


GTCCTGGCCC 


ACATGGAGGC 


CACGGGGGTG 


CGCCTGGACG 


TGGCCTATCT 


CAGGGCCTTG 


1380 


TCCCTGGAGG 


TGGCCGGGGA 


GATCGCCCGC 


CTCGAGGCCG 


AGGTCTTCCG 


CCTGGCCGGC 


1440 


CACCCCTTCA 


ACCTCAACTC 


CCGGGACCAG 


CTGGAAAGGG 


TCCTCTTTGA 


CGAGCTAGGG 


1500 



CTTCCCGCCA TCGGCAAGAC GGAGAAGACC GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 1560 

GAGGCCCTCC GCGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGGCATG CAAGCTTGGC 1620 

ACTGGCCGTC GTTTTACAAC GTCGTGA 1647 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGAATTCGG 


GGATGCTGCC 


CCTCTTTGAG 


CCCAAGGGCC 


GGGTCCTCCT 


GGTGGACGGC 


60 


CACCACCTGG 


CCTACCGCAC 


CTTCCACGCC 


CTGAAGGGCC 


TCACCACCAG 


CCGGGGGGAG 


120 


CCGGTGCAGG 


CGGTCTACGG 


CTTCGCCAAG 


n r*\ n ^"i tv 

AGCCTCCTCA 


AGGCCCTCAA 


GGAGGACGGG 


180 


GACGCGGTGA 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGGG 


240 


GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 


GAGGACTTTC 


CCCGGCAACT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGCTGGCG 


CGCCTCGAGG 


TCCCGGGCTA 


CGAGGCGGAC 


360 


GACGTCCTGG 


CCAGCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GCTACGAGGT 


CCGCATCCTC 


420 


ACCGCCGACA 


AAGACCTTTA 


CCAGCTCCTT 


TCCGACCGCA 


TCCACGTCCT 


CCACCCCGAG 


480 


GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG 


GAAAAGTACG 


GCCTGAGGCC 


CGACCAGTGG 


540 


GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


TCCGACAACC 


TTCCCGGGGT 


CAAGGGCATC 


600 


GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


GAGTGGGGGA 


GCCTGGAAGC 


CCTCCTCAAG 


660 


AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 


GAGAAGATCC 


TGGCCCACAT 


GGACGATCTG 


720 


AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 


AAAAGGCGGG 


AGCCCGACCG 


GGAGAGGCTT 


AGGGCCTTTC 


TGGAGAGGCT 


TGAGTTTGGC 


840 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


AGCCCCAAGG 


CCCTGGAGGA 


GGCCCCCTGG 


900 


CCCCCGCCGG 


AAGGGGCCTT 


CGTGGGCTTT 


GTGCTTTCCC 


GCAAGGAGCC 


CATGTGGGCC 


960 


GATCTTCTGG 


CCCTGGCCGC 


CGCCAGGGGG 


GGCCGGGTCC 


ACCGGGCCCC 


CGAGCCTTAT 


1020 


AAAGCCCTCA 


GGGACCTGAA 


GGAGGCGCGG 


GGGCTTCTCG 


CCAAAGACCT 


GAGCGTTCTG 


1080 


GCCCTGAGGG 


AAGGCCTTGG 


CCTCCCGCCC 


GGCGACGACC 


CCATGCTCCT 


CGCCTACCTC 


1140 


CTGGACCCTT 


CCAACACCAC 


CCCCGAGGGG 


GTGGCCCGGC 


GCTACGGCGG 


GGAGTGGACG 


1200 


GAGGAGGCGG 


GGGAGCGGGC 


CGCCCTTTCC 


GAGAGGCTCT 


TCGCCAACCT 


GTGGGGGAGG 


1260 


CTTGAGGGGG 


AGGAGAGGCT 


CCTTTGGCTT 


TACCGGGAGG 


TGGAGAGGCC 


CCTTTCCGCT 


1320 
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GTCCTGGCCC ACATGGAGGC CACGGGGGTG CGCCTGGACG TGGCCTATCT CAGGGCCTTG 1380 

TCCCTGGAGG TGGCCGGGGA GATCGCCCGC CTCGAGGCCG AGGTCTTCCG CCTGGCCGGC 1440 

CACCCCTTCA ACCTCAACTC CCGGGACCAG CTGGAAAGGG TCCTCTTTGA CGAGCTAGGG 1500 

CTTCCCGCCA TCGGCAAGAC GGAGAAGACC GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 1560 

GAGGCCCTCC GCGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGTACCG GGAGCTCACC 1620 

AAGCTGAAGA GCACCTACAT TGACCCCTTG CCGGACCTCA TCCACCCCAG GACGGGCCGC 1680 

CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA GGCTAAGTAG CTCCGATCCC 1740 

AACCTCCAGA ACATCCCCGT CCGCACCCCG CTTGGGCAGA GGATCCGCCG GGCCTTCATC 1800 

GCCGAGGAGG GGTGGCTATT GGTGGCCCTG GACTATAGCC AGATAGAGCT CAGGGTGCTG 1860 

GCCCACCTCT CCGGCGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGCG GGACATCCAC 1920 

ACGGAGACCG CCAGCTGGAT GTTCGGCGTC CCCCGGGAGG CCGTGGACCC CCTGATGCGC 1980 

CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA TGTCGGCCCA CCGCCTCTCC 2 040 

CAGGAGCTAG CTAGCCATCC CTTACGAGGA GGCCCAGGCC TTCATTGA 2 088 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 60 

CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG CCGGGGGGAG 120 

CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 180 

GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGGG 24 0 

GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC CCCGGCAACT CGCCCTCATC 300 

AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA CGAGGCGGAC 3 60 

GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG GCTACGAGGT CCGCATCCTC 420 

ACCGCCGACA AAGACCTTTA CCAGCTTCTT TCCGACCGCA TCCACGTCCT CCACCCCGAG 4 80 

GGGTACCTCA TCACCCCGGC CTGGCTTTGG GAAAAGTACG GCCTGAGGCC CGACCAGTGG 540 

GCCGACTACC GGGCCCTGAC CGGGGACGAG TCCGACAACC TTCCCGGGGT CAAGGGCATC 600 

GGGGAGAAGA CGGCGAGGAA GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC CCTCCTCAAG 660 

AACCTGGACC GGCTGAAGCC CGCCATCCGG GAGAAGATCC TGGCCCACAT GGACGATCTG 720 
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AAGCTCTCCT GGGACCTGGC CAAGGTGCGC ACCGACCTGC CCCTGGAGGT GGACTTCGCC 780 

AAAAGGCGGG AGCCCGACCG GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT TGAGTTTGGC 84 0 

AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA AGCCCCAAGT CATGGAGGGG GTGTATCCCC 900 

TGGCCGTGCC CCTGGAGGTG GAGGTGGGGA TAGGGGAGGA CTGGCTCTCC GCCAAGGAGT 960 

GA 962 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 



ATGGAATTCG 


GGGATGCTGC 


CCCTCTTTGA 


GCCCAAGGGC 


CGGGTCCTCC 


TGGTGGACGG 


60 


CCACCACCTG 


GCCTACCGCA 


CCTTCCACGC 


CCTGAAGGGC 


CTCACCACCA 


GCCGGGGGGA 


120 


GCCGGTGCAG 


GCGGTCTACG 


GCTTCGCCAA 


GAGCCTCCTC 


AAGGCCCTCA 


AGGAGGACGG 


180 


GGACGCGGTG 


ATCGTGGTCT 


TTGACGCCAA 


GGCCCCCTCC 


TTCCGCCACG 


AGGCCTACGG 


240 


GGGGTACAAG 


GCGGGCCGGG 


CCCCCACGCC 


GGAGGACTTT 


CCCCGGCAAC 


TCGCCCTCAT 


300 


CAAGGAGCTG 


GTGGACCTCC 


TGGGGCTGGC 


GCGCCTCGAG 


GTCCCGGGCT 


ACGAGGCGGA 


360 


CGACGTCCTG 


GCCAGCCTGG 


CCAAGAAGGC 


GGAAAAGGAG 


GGCTACGAGG 


TCCGCATCCT 


420 


CACCGCCGAC 


AAAGACCTTT 


ACCAGCTCCT 


TTCCGACCGC 


ATCCACGTCC 


TCCACCCCGA 


480 


GGGGTACCTC 


ATCACCCCGG 


CCTGGCTTTG 


GGAAAAGTAC 


GGCCTGAGGC 


CCGACCAGTG 


540 


GGCCGACTAC 


CGGGCCCTGA 


CCGGGGACGA 


GTCCGACAAC 


CTTCCCGGGG 


TCAAGGGCAT 


600 


CGGGGAGAAG 


ACGGCGAGGA 


AGCTTCTGGA 


GGAGTGGGGG 


AGCCTGGAAG 


CCCTCCTCAA 


660 


GAACCTGGAC 


CGGCTGAAGC 


CCGCCATCCG 


GGAGAAGATC 


CTGGCCCACA 


TGGACGATCT 


720 


GAAGCTCTCC 


TGGGACCTGG 


CCAAGGTGCG 


CACCGACCTG 


CCCCTGGAGG 


TGGACTTCGC 


780 


CAAAAGGCGG 


GAGCCCGACC 


GGGAGAGGCT 


TAGGGCCTTT 


CTGGAGAGGC 


TTGAGTTTGG 


840 


CAGCCTCCTC 


CACGAGTTCG 


GCCTTCTGGA 


AAGCCCCAAG 


ATCCGCCGGG 


CCTTCATCGC 


900 


CGAGGAGGGG 


TGGCTATTGG 


TGGCCCTGGA 


CTATAGCCAG 


ATAGAGCTCA 


GGGTGCTGGC 


960 


CCACCTCTCC 


GGCGACGAGA 


ACCTGATCCG 


GGTCTTCCAG 


GAGGGGCGGG 


ACATCCACAC 


1020 


GGAGACCGCC 


AGCTGGATGT 


TCGGCGTCCC 


CCGGGAGGCC 


GTGGACCCCC 


TGATGCGCCG 


1080 


GGCGGCCAAG 


ACCATCAACT 


TCGGGGTCCT 


CTACGGCATG 


TCGGCCCACC 


GCCTCTCCCA 


1140 


GGAGCTAGCC 


ATCCCTTACG 


AGGAGGCCCA 


GGCCTTCATT 


GAGCGCTACT 


TTCAGAGCTT 


1200 
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CCCCAAGGTG CGGGCCTGGA TTGAGAAGAC CCTGGAGGAG GGCAGGAGGC GGGGGTACGT 12 60 

GGAGACCCTC TTCGGCCGCC GCCGCTACGT GCCAGACCTA GAGGCCCGGG TGAAGAGCGT 132 0 

GCGGGAGGCG GCCGAGCGCA TGGCCTTCAA CATGCCCGTC CGGGGCACCG CCGCCGACCT 13 80 

CATGAAGCTG GCTATGGTGA AGCTCTTCCC CAGGCTGGAG GAAATGGGGG CCAGGATGCT 1440 

CCTTCAGGTC CACGACGAGC TGGTCCTCGA GGCCCCAAAA GAGAGGGCGG AGGCCGTGGC 1500 

CCGGCTGGCC AAGGAGGTCA TGGAGGGGGT GTATCCCCTG GCCGTGCCCC TGGAGGTGGA 1560 

GGTGGGGATA GGGGAGGACT GGCTCTCCGC CAAGGAGTGA 1600 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
CACGAATTCG GGGATGCTGC CCCTCTTTGA GCCCAA 3 6 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
GTGAGATCTA TCACTCCTTG GCGGAGAGCC AGTC 34 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TAATACGACT CACTATAGGG AGACCGGAAT TCGAGCTCGC CCGGGCGAGC TCGAATTCCG 60 
TGTATTCTAT AGTGTCACCT AAATCGAATT C 91 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

TAATACGACT CACTATAGGG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

GAATTCGATT TAGGTGACAC TATAGAA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GTAATCATGG TCATAGCTGG TAGCTTGCTA C 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGATCCTCTA GAGTCGACCT GCAGGCATGC CTACCTTGGT AG 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GGATCCTCTA GAGTCGACCT GCAGGCATGC 3 0 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



ATGAATTCGG 


GGATGCTGCC 


CCTCTTTGAG 


CCCAAGGGCC 


GGGTCCTCCT 


GGTGGACGGC 


60 


CACCACCTGG 


CCTACCGCAC 


CTTCCACGCC 


CTGAAGGGCC 


TCACCACCAG 


CCGGGGGGAG 


120 


CCGGTGCAGG 


CGGTCTACGG 


CTTCGCCAAG 


AGCCTCCTCA 


AGGCCCTCAA 


GGAGGACGGG 


180 


GACGCGGTGA 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGGG 


240 


GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 


GAGGACTTTC 


CCCGGCAACT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGCTGGCG 


CGCCTCGAGG 


TCCCGGGCTA 


CGAGGCGGAC 


360 


GACGTCCTGG 


CCAGCCTGGC 


CAAGAAGGCG 


GAAAAGGAGG 


GCTACGAGGT 


CCGCATCCTC 


420 


ACCGCCGACA 


AAGACCTTTA 


CCAGCTCCTT 


TCCGACCGCA 


TCCACGTCCT 


CCACCCCGAG 


480 


GGGTACCTCA 


TCACCCCGGC 


CTGGCTTTGG 


GAAAAGTACG 


GCCTGAGGCC 


CGACCAGTGG 


540 


GCCGACTACC 


GGGCCCTGAC 


CGGGGACGAG 


TCCGACAACC 


TTCCCGGGGT 


CAAGGGCATC 


600 


GGGGAGAAGA 


CGGCGAGGAA 


GCTTCTGGAG 


GAGTGGGGGA 


GCCTGGAAGC 


CCTCCTCAAG 


660 


AACCTGGACC 


GGCTGAAGCC 


CGCCATCCGG 


GAGAAGATCC 


TGGCCCACAT 


GGACGATCTG 


720 


AAGCTCTCCT 


GGGACCTGGC 


CAAGGTGCGC 


ACCGACCTGC 


CCCTGGAGGT 


GGACTTCGCC 


780 


AAAAGGCGGG 


AGCCCGACCG 


GGAGAGGCTT 


AGGGCCTTTC 


TGGAGAGGCT 


TGAGTTTGGC 


840 


AGCCTCCTCC 


ACGAGTTCGG 


CCTTCTGGAA 


AGCCCCAAGG 


CCCTGGAGGA 


GGCCCCCTGG 


900 


CCCCCGCCGG 


AAGGGGCCTT 


CGTGGGCTTT 


GTGCTTTCCC 


GCAAGGAGCC 


CATGTGGGCC 


960 


GATCTTCTGG 


CCCTGGCCGC 


CGCCAGGGGG 


GGCCGGGTCC 


ACCGGGCCCC 


CGAGCCTTAT 


1020 


AAAGCCCTCA 


GGGACCTGAA 


GGAGGCGCGG 


GGGCTTCTCG 


CCAAAGACCT 


GAGCGTTCTG 


1080 
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GCCCTGAGGG AAGGCCTTGG CCTCCCGCCC GGCGACGACC CCATGCTCCT CGCCTACCTC 1140 

CTGGACCCTT CCAACACCAC CCCCGAGGGG GTGGCCCGGC GCTACGGCGG GGAGTGGACG 1200 

GAGGAGGCGG GGGAGCGGGC CGCCCTTTCC GAGAGGCTCT TCGCCAACCT GTGGGGGAGG 1260 

CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT TACCGGGAGG TGGAGAGGCC CCTTTCCGCT 1320 

GTCCTGGCCC ACATGGAGGC CACGGGGGTG CGCCTGGACG TGGCCTATCT CAGGGCCTTG 13 80 

TCCCTGGAGG TGGCCGGGGA GATCGCCCGC CTCGAGGCCG AGGTCTTCCG CCTGGCCGGC 1440 

CACCCCTTCA ACCTCAACTC CCGGGACCAG CTGGAAAGGG TCCTCTTTGA CGAGCTAGGG 1500 

CTTCCCGCCA TCGGCAAGAC GGAGAAGACC GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 1560 

GAGGCCCTCC GCGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGTACCG GGAGCTCACC 1620 

AAGCTGAAGA GCACCTACAT TGACCCCTTG CCGGACCTCA TCCACCCCAG GACGGGCCGC 1680 

CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA GGCTAAGTAG CTCCGATCCC 1740 

AACCTCCAGA ACATCCCCGT CCGCACCCCG CTTGGGCAGA GGATCCGCCG GGCCTTCATC 1800 

GCCGAGGAGG GGTGGCTATT GGTGGCCCTG GACTATAGCC AGATAGAGCT CAGGGTGCTG 1860 

GCCCACCTCT CCGGCGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGCG GGACATCCAC 1920 

ACGGAGACCG CCAGCTGGAT GTTCGGCGTC CCCCGGGAGG CCGTGGACCC CCTGATGCGC 1980 

CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA TGTCGGCCCA CCGCCTCTCC 2 040 

CAGGAGCTAG CCATCCCTTA CGAGGAGGCC CAGGCCTTCA TTGAGCGCTA CTTTCAGAGC 2100 

TTCCCCAAGG TGCGGGCCTG GATTGAGAAG ACCCTGGAGG AGGGCAGGAG GCGGGGGTAC 2160 

GTGGAGACCC TCTTCGGCCG CCGCCGCTAC GTGCCAGACC TAGAGGCCCG GGTGAAGAGC 2220 

GTGCGGGAGG CGGCCGAGCG CATGGCCTTC AACATGCCCG TCCGGGGCAC CGCCGCCGAC 22 80 

CTCATGAAGC TGGCTATGGT GAAGCTCTTC CCCAGGCTGG AGGAAATGGG GGCCAGGATG 2340 

CTCCTTCAGG TCCACGACGA GCTGGTCCTC GAGGCCCCAA AAGAGAGGGC GGAGGCCGTG 24 00 

GCCCGGCTGG CCAAGGAGGT CATGGAGGGG GTGTATCCCC TGGCCGTGCC CCTGGAGGTG 24 60 

GAGGTGGGGA TAGGGGAGGA CTGGCTCTCC GCCAAGGAGT GA 2502 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GATTTAGGTG ACACTATAG 19 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGGACGAACA AGCGAGACAG CGACACAGGT ACCACATGGT ACAAGAGGCA AGAGAGACGA 60 
CACAGCAGAA AC 72 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GTTTCTGCTG TGTCGTCTCT CTTGCCTCTT GTACCATGTG GTACCTGTGT CGCTGTCTCG 60 
CTTGTTCGTC 70 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GACGAACAAG CGAGACAGCG 20 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



GTTTCTGCTG TGTCGTCTCT CTTG 



24 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CCTCTTGTAC CATGTGGTAC CTGTGTCGCT GTCTCGCTTG TTCGTC 46 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ACACAGGTAC CACATGGTAC AAGAGGCAAG AGAGACGACA CAGCAGAAAC 50 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



ATGGCTAGCA 


TGACTGGTGG 


ACAGCAAATG 


GGTCGGATCA 


ATTCGGGGAT 


m f% y*« y"»| yA| y^m yT 

GCTGCCCCTC 


60 


TTTGAGCCCA 


AGGGCCGGGT 


CCTCCTGGTG 


GACGGCCACC 


ACCTGGCCTA 


CCGCACCTTC 


120 


CACGCCCTGA 


AGGGCCTCAC 


CACCAGCCGG 


y^« /■« ✓"I V*"1 y~1 y~1 /""< 

GGGGAGCCGG 


TGCAGGCGGT 


y^i^i ta /"i yr. y"t y"*i mm /•■'I 

CTACGGCTTC 


180 


y—* yw* TV TV yT, TV /*% y~"i 

GCCAAGAGCC 


TCCTCAAGGC 


CCTCAAGGAG 


GACGGGGACG 


CGGTGATCGT 


GGTCTTTGAC 


240 


/-<■ TV T\ y^i f% y»i y^| 

GCCAAGGCCC 


CCTCCTTCCG 


CCACGAGGCC 


TACGGGGGGT 


TV /"I TV T\ /~1 VI /*1 /^l 

ACAAGGCGGG 


CCGGGCCCCC 


300 


ACGCCGGAGG 


ACTTTCCCCG 


GCAACTCGCC 


CTCATCAAGG 


Tv y^im/l y^i tv 

AGCTGGTGGA 


y-« /^i^n y™t y"i m y"*< y"i y - *i 

CCTCCTGGGG 


360 


CTGGCGCGCC 


TCGAGGTCCC 


GGGCTACGAG 


y-»» y^a O TV /*1 y*** Tv y**i y"*i 

GCGGACGACG 


TCCTGGCCAG 


y*«l /""T^^ y^l y^i y*T y^l TV TV 

CCTGGCCAAG 


420 


AAGGCGGAAA 


AGGAGGGCTA 


CGAGGTCCGC 


ATCCTCACCG 


CCGACAAAGA 


CCTTTACCAG 


480 


CTTCTTTCCG 


ACCGCATCCA 


CGTCCTCCAC 


CCCGAGGGGT 


tv y^r r"<« m y"t tv rrt y^ *jv y^i 

ACCTCATCAC 


y—< y"T y*"1 y"i y"f y^* y— * y™i y"T 

CCCGGCCTGG 


540 


CTTTGGGAAA 


AGTACGGCCT 


GAGGCCCGAC 


CAGTGGGCCG 


ACTACCGGGC 


^inmnn y™i y—t y~t /r y"*« 

CCTGACCGGG 


600 


GACGAGTCCG 


ACAACCTTCC 


CGGGGTCAAG 


GGCATCGGGG 


TV y"i "ft TV /"I Tv y~t y™* m m 

AGAAGACGGC 


y"T TV y~i y-*i T\ TV y"T rtmm 

GAGGAAGCTT 


660 


CTGGAGGAGT 


GGGGGAGCCT 


GGAAGCCCTC 


y^f m y*i tv tv y^i tv tv V"< y"*t 

CTCAAGAACC 


m y^i y~i t\ y*i y™* y™< y™* y^m 

TGGACCGGCT 


y— « TV TV y~1 /*I/T y~1 yr^a y - *! y*»i 

GAAGCCCGCC 


720 


ATCCGGGAGA 


AGATCCTGGC 


CCACATGGAC 


GATCTGAAGC 


TCTCCTGGGA 


CCTGGCCAAG 


780 


GTGCGCACCG 


ACCTGCCCCT 


GGAGGTGGAC 


TTCGCCAAAA 


GGCGGGAGCC 


CGACCGGGAG 


840 


AGGCTTAGGG 


CCTTTCTGGA 


GAGGCTTGAG 


TTTGGCAGCC 


TCCTCCACGA 


GTTCGGCCTT 


900 


CTGGAAAGCC 


CCAAGTCATG 


GAGGGGGTGT 


ATCCCCTGGC 


CGTGCCCCTG 


GAGGTGGAGG 


960 


TGGGGATAG 












969 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 



ATGGCTAGCA 


TGACTGGTGG 


ACAGCAAATG 


GGTCGGATCA 


ATTCGGGGAT 


GCTGCCCCTC 


60 


TTTGAGCCCA 


AGGGCCGGGT 


CCTCCTGGTG 


GACGGCCACC 


ACCTGGCCTA 


CCGCACCTTC 


120 


CACGCCCTGA 


AGGGCCTCAC 


CACCAGCCGG 


GGGGAGCCGG 


TGCAGGCGGT 


CTACGGCTTC 


180 


GCCAAGAGCC 


TCCTCAAGGC 


CCTCAAGGAG 


GACGGGGACG 


CGGTGATCGT 


GGTCTTTGAC 


240 


GCCAAGGCCC 


CCTCCTTCCG 


CCACGAGGCC 


TACGGGGGGT 


ACAAGGCGGG 


CCGGGCCCCC 


300 


ACGCCGGAGG 


ACTTTCCCCG 


GCAACTCGCC 


CTCATCAAGG 


AGCTGGTGGA 


CCTCCTGGGG 


360 


CTGGCGCGCC 


TCGAGGTCCC 


GGGCTACGAG 


GCGGACGACG 


TCCTGGCCAG 


CCTGGCCAAG 


420 
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AAGGCGGAAA AGGAGGGCTA CGAGGTCCGC ATCCTCACCG CCGACAAAGA CCTTTACCAG 



480 



CTTCTTTCCG ACCGCATCCA CGTCCTCCAC CCCGAGGGGT ACCTCATCAC CCCGGCCTGG 



540 



CTTTGGGAAA AGTACGGCCT GAGGCCCGAC CAGTGGGCCG ACTACCGGGC CCTGACCGGG 



600 



GACGAGTCCG ACAACCTTCC CGGGGTCAAG GGCATCGGGG AGAAGACGGC GAGGAAGCTT 



660 



CTGGAGGAGT GGGGGAGCCT GGAAGCCCTC CTCAAGAACC TGGACCGGCT GAAGCCCGCC 



720 



ATCCGGGAGA AGATCCTGGC CCACATGGAC GATCTGAAGC TCTCCTGGGA CCTGGCCAAG 



780 



GTGCGCACCG ACCTGCCCCT GGAGGTGGAC TTCGCCAAAA GGCGGGAGCC CGACCGGGAG 



840 



AGGCTTAGGG CCTTTCTGGA GAGGCTTGAG TTTGGCAGCC TCCTCCACGA GTTCGGCCTT 



900 



CTGGAAAGCC CCAAGGCCGC ACTCGAGCAC CACCACCACC ACCACTGA 



948 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG TAATACGACT 60 
CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT 120 
GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG GTCATAGCTG 180 
TTTCCTGTGT GAAATTGTTA TCCGCT 206 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TTCTGGGTTC TCTGCTCTCT GGTCGCTGTC TCGCTTGTTC GTC 43 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCTGTCTCGC TTGTTCGTC 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GACGAACAAG CGAGACAGCG 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

TTCTGGGTTC TCTGCTCTCT GGTC 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

GACGAACAAG CGAGACAGCG ACCAGAGAGC AGAGAACCCA GAA 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ACCAGAGAGC AGAGAACCCA GAA 

(2) INFORMATION FOR SEQ ID NO: 39: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AACAGCTATG ACCATGATTA C 21 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCGGGAA 60 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 12 0 

GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 157 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTAGACA TAACCGGGAA 60 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 12 0 

GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 157 
(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
CACCGTCCTC TTCAAGAAG 19 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

CTGAATCTTG TAGATAGCTA 2 0 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

GCCTTATTTT ACTTTAAAAA TTTTCAAATG TTTCTTTTAT ACACAATATG TTTCTTAGTC 60 

TGAATAACCT TTTCCTCTGC AGTATTTTTG AGCAGTGGCT CCGAAGGCAC CGTCCTCTTC 12 0 

AAGAAGTTTA TCCAGAAGCC AATGCACCCA TTAGACATAA CCGGGAATCC TACATGGTTC 180 

CTTTTATACC ACTGTACAGA AATGGTGATT TCTTTATTTC ATCCAAAGAT CTGGGCTATG 24 0 

ACTATAGCTA TCTACAAGAT TCAGGTAAAG TTTACTTTCT TTCAGAGGAA TTGCTGAATC 3 00 

TAGTGTTACC AATTTATTTT GAGATAACAC AAAACTTTA 339 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GCCTTATTTT ACTTTAAAAA T 21 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



TAAAGTTTTG TGTTATCTCA 



20 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCGGGAA 60 

TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 12 0 

GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 157 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
CTGAATCTTG TAGATAGCTA TAGTCATAGC CCAGATCTTT GGATGAAATA AAGAAATCAC 60 
CATTTCTGTA CAGTGGTATA AAAGGAACCA TGTAGGATTC CCGGTTATGT CCAATGGGTG 12 0 

CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 157 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTATTTAGG 60 
TGACACTATA GAATACTCAA GCTTGCATGC CTGCAGGTCG ACTCTAGAGG ATCCCCGGGT 12 0 

ACCGAGCTCG AATTCGCCCT ATAGTGAGTC GTATTAGGAT CCGTG 165 
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(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG TAATACGACT 60 
CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT 120 
GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG GTCATAGCTG 180 
TTTCCTGTGT GAAATTGTTA TCCGCT 2 06 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
AGCGGATAAC AATTTCACAC AGGA 24 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
CACGGATCCT AATACGACTC ACTATAGGG 29 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
CGCCAGGGTT TTCCCAGTCA CGAC 24 
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(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTAGACA TAACCGGGAA 60 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 12 0 

GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 157 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCAGGAA 60 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 120 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 157 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCGGGAA 60 

TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 12 0 

GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAGACC CAGACTCTTT TCAAGACTAC 180 

ATTAAGTCCT ATTTGGAACA AGCGAGTCGG ATCTGGTCAT GGCTCCTTGG GGCGGCGATG 24 0 

GTAGGGGCCG TCCTCACTGC CCTGCTGGCA GGGCTTGTGA GCTTGCTGTG TCGTCACAAG 300 

AGAAAGCAGC TTCCTGAAGA AAAGCAGCCA CTCCTCATGG AGAAAGAGGA TTACCACAGC 360 

TTGTATCAGA GCCATTTA 378 
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(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCAGGAA 60 

TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 12 0 

GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAGACC CAGACTCTTT TCAAGACTAC 180 

ATTAAGTCCT ATTTGGAACA AGCGAGTCGG ATCTGGTCAT GGCTCCTTGG GGCGGCGATG 24 0 

GTAGGGGCCG TCCTCACTGC CCTGCTGGCA GGGCTTGTGA GCTTGCTGTG TCGTCACAAG 30 0 

AGAAAGCAGC TTCCTGAAGA AAAGCAGCCA CTCCTCATGG AGAAAGAGGA TTACCACAGC 360 

TTGTATCAGA GCCATTTA 378 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 



GCAAGTTTGG 


CTTTTGGGGA 


CCAAACTGCA 


CAGAGAGACG 


ACTCTTGGTG 


AGAAGAAACA 


60 


TCTTCGATTT 


GAGTGCCCCA 


GAGAAGGACA 


AATTTTTTGC 


CTACCTCACT 


TTAGCAAAGC 


120 


ATACCATCAG 


CTCAGACTAT 


GTCATCCCCA 


TAGGGACCTA 


TGGCCAAATG 


AAAAATGGAT 


180 


CAACACCCAT 


GTTTAACGAC 


ATCAATATTT 


ATGACCTCTT 


TGTCTGGATG 


CATTATTATG 


240 


TGTCAATGGA 


TGCACTGCTT 


GGGGGATATG 


AAATCTGGAG 


AGACATTGAT 


TTTGCCCATG 


300 


AAGCACCAGC 


TTTTCTGCCT 


TGGCATAGAC 


TCTTCTTGTT 


GCGGTGGGAA 


CAAGAAATCC 


3 60 


AGAAGCTGAC 


AGGAGATGAA 


AACTTCACTA 


TTCCATATTG 


GGACTGGCGG 


GATGCAGAAA 


420 


AGTGTGACAT 


TTGCACAGAT 


GAGTACATGG 


GAGGTCAGCA 


CCCCACAAAT 


CCTAACTTAC 


480 


TCAGCCCAGC 


ATCATTCTTC 


TCCTCTTGGC 


AGATTGTCTG 


TAGCCGATTG 


GAGGAGTACA 


540 


ACAGCCATCA 


GTCTTTATGC 


AATGGAACGC 


CCGAGGGACC 


TTTACGGCGT 


AATCCTGGAA 


600 


ACCATGACAA 


ATCCAGAACC 


CCAAGGCTCC 


CCTCTTCAGC 


TGATGTAGAA 


TTTTGCCTGA 


660 


GTTTGACCCA 


ATATGAATCT 


GGTTCCATGG 


ATAAAGCTGC 


CAATTTCAGC 


TTTAGAAATA 


720 
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CACTGGAAGG ATTTGCTAGT CCACTTACTG GGATAGCGGA TGCCTCTCAA AGCAGCATGC 780 

ACAATGCCTT GCACATCTAT ATGAATGGAA CAATGTCCCA GGTACAGGGA TCTGCCAACG 840 

ATCCTATCTT CCTTCTTCAC CATGCATTTG TTGACAGTAT TTTTGAGCAG TGGCTCCGAA 900 

GGCACCGTCC TCTTCAAGAA GTTTATCCAG AAGCCAATGC ACCCATTGGA CATAACCGGG 960 

AATCCTACAT GGTTCCTTTT ATACCACTGT ACAGAAATGG TGATTTCTTT ATTTCATCCA 1020 

AAGATCTGGG CTATGACTAT AGCTATCTAC AAGATTCAG 1059 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GCAAGTTTGG CTTTTGGGGA CCAAACTGCA CAGAGAGACG ACTCTTGGTG AGAAGAAACA 60 

TCTTCGATTT GAGTGCCCCA GAGAAGGACA AATTTTTTGC CTACCTCACT TTAGCAAAGC 120 

ATACCATCAG CTCAGACTAT GTCATCCCCA TAGGGACCTA TGGCCAAATG AAAAATGGAT 180 

CAACACCCAT GTTTAACGAC ATCAATATTT ATGACCTCTT TGTCTGGATG CATTATTATG 240 

TGTCAATGGA TGCACTGCTT GGGGGATATG AAATCTGGAG AGACATTGAT TTTGCCCATG 300 

AAGCACCAGC TTTTCTGCCT TGGCATAGAC TCTTCTTGTT GCGGTGGGAA CAAGAAATCC 360 

AGAAGCTGAC AGGAGATGAA AACTTCACTA TTCCATATTG GGACTGGCGG GATGCAGAAA 420 

AGTGTGACAT TTGCACAGAT GAGTACATGG GAGGTCAGCA CCCCACAAAT CCTAACTTAC 480 

TCAGCCCAGC ATCATTCTTC TCCTCTTGGC AGATTGTCTG TAGCCGATTG GAGGAGTACA 540 

ACAGCCATCA GTCTTTATGC AATGGAACGC CCGAGGGACC TTTACGGCGT AATCCTGGAA 600 

ACCATGACAA ATCCAGAACC CCAAGGCTCC CCTCTTCAGC TGATGTAGAA TTTTGCCTGA 660 

GTTTGACCCA ATATGAATCT GGTTCCATGG ATAAAGCTGC CAATTTCAGC TTTAGAAATA 720 

CACTGGAAGG ATTTGCTAGT CCACTTACTG GGATAGCGGA TGCCTCTCAA AGCAGCATGC 780 

ACAATGCCTT GCACATCTAT ATGAATGGAA CAATGTCCCA GGTACAGGGA TCTGCCAACG 840 

ATCCTATCTT CCTTCTTCAC CATGCATTTG TTGACAGTAT TTTTGAGCAG TGGCTCCGAA 900 

GGCACCGTCC TCTTCAAGAA GTTTATCCAG AAGCCAATGC ACCCATTGGA CATAACCAGG 960 

AATCCTACAT GGTTCCTTTT ATACCACTGT ACAGAAATGG TGATTTCTTT ATTTCATCCA 1020 

AAGATCTGGG CTATGACTAT AGCTATCTAC AAGATTCAG 1059 
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(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 



ATGCTCCTGG 


CTGTTTTGTA 


CTGCCTGCTG 


TGGAGTTTCC 


AGACCTCCGC 


TGGCCATTTC 


60 


CCTAGAGCCT 


GTGTCTCCTC 


TAAGAACCTG 


ATGGAGAAGG 


AATGCTGTCC 


ACCGTGGAGC 


120 


GGGGACAGGA 


GTCCCTGTGG 


CCAGCTTTCA 


GGCAGAGGTT 


CCTGTCAGAA 


TATCCTTCTG 


180 


TCCAATGCAC 


CACTTGGGCC 


TCAATTTCCC 


TTCACAGGGG 


TGGATGACCG 


GGAGTCGTGG 


240 


CCTTCCGTCT 


TTTATAATAG 


GACCTGCCAG 


TGCTCTGGCA 


ACTTCATGGG 


ATTCAACTGT 


300 


GGAAACTGCA 


AGTTTGGCTT 


TTGGGGACCA 


AACTGCACAG 


AGAGACGACT 


CTTGGTGAGA 


360 


AGAAACATCT 


TCGATTTGAG 


TGCCCCAGAG 


AAGGACAAAT 


TTTTTGCCTA 


CCTCACTTTA 


420 


GCAAAGCATA 


CCATCAGCTC 


AGACTATGTC 


ATCCCCATAG 


GGACCTATGG 


C C AAATGAAA 


480 


AATGGATCAA 


CACCCATGTT 


TAACGACATC 


AATATTTATG 


ACCTCTTTGT 


CTGGATGCAT 


540 


TATTATGTGT 


CAATGGATGC 


ACTGCTTGGG 


GGATATGAAA 


TCTGGAGAGA 


CATTGATTTT 


600 


GCCCATGAAG 


CACCAGCTTT 


TCTGCCTTGG 


CATAGACTCT 


TCTTGTTGCG 


GTGGGAACAA 


660 


GAAATCCAGA 


AGCTGACAGG 


AGATGAAAAC 


TTCACTATTC 


CATATTGGGA 


CTGGCGGGAT 


720 


GCAGAAAAGT 


GTGACATTTG 


CACAGATGAG 


TACATGGGAG 


GTCAGCACCC 


CACAAATCCT 


780 


AACTTACTCA 


GCCCAGCATC 


ATTCTTCTCC 


TCTTGGCAGA 


TTGTCTGTAG 


CCGATTGGAG 


840 


GAGTACAACA 


GCCATCAGTC 


TTTATGCAAT 


GGAACGCCCG 


AGGGACCTTT 


ACGGCGTAAT 


900 


CCTGGAAACC 


ATGACAAATC 


CAGAACCCCA 


AGGCTCCCCT 


CTTCAGCTGA 


TGTAGAATTT 


960 


TGCCTGAGTT 


TGACCCAATA 


TGAATCTGGT 


TCCATGGATA 


AAGCTGCCAA 


TTTCAGCTTT 


1020 


AGAAATACAC 


TGGAAGGATT 


TGCTAGTCCA 


CTTACTGGGA 


TAGCGGATGC 


CTCTCAAAGC 


1080 


AGCATGCACA 


ATGCCTTGCA 


CATCTATATG 


AATGGAACAA 


TGTCCCAGGT 


ACAGGGATCT 


1140 


GCCAACGATC 


CTATCTTCCT 


TCTTCACCAT 


GCATTTGTTG 


ACAGTATTTT 


TGAGCAGTGG 


1200 


CTCCGAAGGC 


ACCGTCCTCT 


TCAAGAAGTT 


TATCCAGAAG 


CCAATGCACC 


CATTGGACAT 


1260 


AACCGGGAAT 


CCTACATGGT 


TCCTTTTATA 


CCACTGTACA 


GAAATGGTGA 


TTTCTTTATT 


1320 


TCATCCAAAG 


ATCTGGGCTA 


TGACTATAGC 


TATCTACAAG 


ATTCAGACCC 


AGACTCTTTT 


1380 


CAAGACTACA 


TTAAGTCCTA 


TTTGGAACAA 


GCGAGTCGGA 


TCTGGTCATG 


GCTCCTTGGG 


1440 
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# 



GCGGCGATGG TAGGGGCCGT CCTCACTGCC CTGCTGGCAG GGCTTGTGAG CTTGCTGTGT 150 0 

CGTCACAAGA GAAAGCAGCT TCCTGAAGAA AAGCAGCCAC TCCTCATGGA GAAAGAGGAT 1560 

TACCACAGCT TGTATCAGAG CCATTTA 1587 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

ATGCTCCTGG CTGTTTTGTA CTGCCTGCTG TGGAGTTTCC AGACCTCCGC TGGCCATTTC 60 

CCTAGAGCCT GTGTCTCCTC TAAGAACCTG ATGGAGAAGG AATGCTGTCC ACCGTGGAGC 12 0 

GGGGACAGGA GTCCCTGTGG CCAGCTTTCA GGCAGAGGTT CCTGTCAGAA TATCCTTCTG 180 

TCCAATGCAC CACTTGGGCC TCAATTTCCC TTCACAGGGG TGGATGACCG GGAGTCGTGG 240 

CCTTCCGTCT TTTATAATAG GACCTGCCAG TGCTCTGGCA ACTTCATGGG ATTCAACTGT 300 

GGAAACTGCA AGTTTGGCTT TTGGGGACCA AACTGCACAG AGAGACGACT CTTGGTGAGA 360 

AGAAACATCT TCGATTTGAG TGCCCCAGAG AAGGACAAAT TTTTTGCCTA CCTCACTTTA 42 0 

GCAAAGCATA CCATCAGCTC AGACTATGTC ATCCCCATAG GGACCTATGG CCAAATGAAA 4 80 

AATGGATCAA CACCCATGTT TAACGACATC AATATTTATG ACCTCTTTGT CTGGATGCAT 54 0 

TATTATGTGT CAATGGATGC ACTGCTTGGG GGATATGAAA TCTGGAGAGA CATTGATTTT 600 

GCCCATGAAG CACCAGCTTT TCTGCCTTGG CATAGACTCT TCTTGTTGCG GTGGGAACAA 660 

GAAATCCAGA AGCTGACAGG AGATGAAAAC TTCACTATTC CATATTGGGA CTGGCGGGAT 720 

GCAGAAAAGT GTGACATTTG CACAGATGAG TACATGGGAG GTCAGCACCC CACAAATCCT 780 

AACTTACTCA GCCCAGCATC ATTCTTCTCC TCTTGGCAGA TTGTCTGTAG CCGATTGGAG 84 0 

GAGTACAACA GCCATCAGTC TTTATGCAAT GGAACGCCCG AGGGACCTTT ACGGCGTAAT 900 

CCTGGAAACC ATGACAAATC CAGAACCCCA AGGCTCCCCT CTTCAGCTGA TGTAGAATTT 960 

TGCCTGAGTT TGACCCAATA TGAATCTGGT TCCATGGATA AAGCTGCCAA TTTCAGCTTT 102 0 

AGAAATACAC TGGAAGGATT TGCTAGTCCA CTTACTGGGA TAGCGGATGC CTCTCAAAGC 1080 

AGCATGCACA ATGCCTTGCA CATCTATATG AATGGAACAA TGTCCCAGGT ACAGGGATCT 1140 

GCCAACGATC CTATCTTCCT TCTTCACCAT GCATTTGTTG ACAGTATTTT TGAGCAGTGG 12 00 

CTCCGAAGGC ACCGTCCTCT TCAAGAAGTT TATCCAGAAG CCAATGCACC CATTGGACAT 1260 

AACCAGGAAT CCTACATGGT TCCTTTTATA CCACTGTACA GAAATGGTGA TTTCTTTATT 132 0 
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TCATCCAAAG ATCTGGGCTA TGACTATAGC TATCTACAAG ATTCAGACCC AGACTCTTTT 1380 

CAAGACTACA TTAAGTCCTA TTTGGAACAA GCGAGTCGGA TCTGGTCATG GCTCCTTGGG 1440 

GCGGCGATGG TAGGGGCCGT CCTCACTGCC CTGCTGGCAG GGCTTGTGAG CTTGCTGTGT 1500 

CGTCACAAGA GAAAGCAGCT TCCTGAAGAA AAGCAGCCAC TCCTCATGGA GAAAGAGGAT 1560 

TACCACAGCT TGTATCAGAG CCATTTA 1587 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
TAAATGGCTC TGATACAAGC T 21 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GCAAGTTTGG CTTTTGGGGA 20 
(2) INFORMATION FOR SEQ ID NO : 64 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 
ATGCTCCTGG CTGTTTTGTA CTG 23 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



CTGAATCTTG TAGATAGCTA TAGTCATAGC CCAGATCTTT GGATGAAATA AAGAAATCAC 



60 



CATTTCTGTA CAGTGGTATA AAAGGAACCA TGTAGGATTC CCGGTTATGT CTAATGGGTG 



120 



CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 



157 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CTGAATCTTG TAGATAGCTA TAGTCATAGC CCAGATCTTT GGATGAAATA AAGAAATCAC 60 

CATTTCTGTA CAGTGGTATA AAAGGAACCA TGTAGGATTC CTGGTTATGT CCAATGGGTG 12 0 

CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 157 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GGTTGGCCAA TCTACTCCCA GG 22 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
GCTCACTCAG TGTGGCAAAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


120 


CAAACAGACA 


CCATGGTGCA 


CCTGACTCCT 


GAGGAGAAGT 


CTGCCGTTAC 


TGCCCTGTGG 


180 


GGCAAGGTGA 


ACGTGGATGA 


AGTTGGTGGT 


GAGGCCCTGG 


GCAGGTTGGT 


ATCAAGGTTA 


240 


CAAGACAGGT 


TTAAGGAGAC 


CAATAGAAAC 


TGGGCATGTG 


GAGACAGAGA 


AGACTCTTGG 


300 


GTTTCTGATA 


GGCACTGACT 


CTCTCTGCCT 


ATTGGTCTAT 


TTTCCCACCC 


TTAGGCTGCT 


360 


GGTGGTCTAC 


CCTTGGACCC 


AGAGGTTCTT 


TGAGTCCTTT 


GGGGATCTGT 


CCACTCCTGA 


420 


TGCTGTTATG 


GGCAACCCTA 


AGGTGAAGGC 


TCATGGCAAG 


AAAGTGCTCG 


GTGCCTTTAG 


480 


TGATGGCCTG 


GCTCACCTGG 


ACAACCTCAA 


GGGCACCTTT 


GCCACACTGA 


GTGAGC 


536 


(2) INFORMATION FOR SEQ ID NO: 70: 










(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








(ii) MOLECULE TYPE: DNA (genomic) 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 








GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


120 


CAAACAGACA 


CCATGGTGCA 


TCTGACTCCT 


GAGGAGGTCT 


GCCGTTACTG 


CCCTGTGGGG 


180 


CAAGGTGAAC 


GTGGATGAAG 


TTGGTGGTGA 


GGCCCTGGGC 


AGGTTGGTAT 


CAAGGTTACA 


240 


AGACAGGTTT 


AAGGAGACCA 


ATAGAAACTG 


GGCATGTGGA 


GACAGAGAAG 


ACTCTTGGGT 


300 


TTCTGATAGG 


CACTGACTCT 


CTCTGCCTAT 


TGGTCTATTT 


TCCCACCCTT 


AGGCTGCTGG 


360 


TGGTCTACCC 


TTGGACCCAG 


AGGTTCTTTG 


AGTCCTTTGG 


GGATCTGTCC 


ACTCCTGATG 


420 


CTGTTATGGG 


CAACCCTAAG 


GTGAAGGCTC 


ATGGCAAGAA 


AGTGCTCGGT 


GCCTTTAGTG 


480 


ATGGCCTGGC 


TCACCTGGAC 


AACCTCAAGG 


GCACCTTTGC 


CACACTGAGT 


GAGC 


534 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC CAGGGCTGGG CATAAAAGTC 60 

AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA ACTGTGTTCA CTAGCAACCT 12 0 

CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGAAGT CTGCCGTTAC TGCCCTGTGG 180 

GGCAAGGTGA ACGTGGATGA AGTTGGTGGT GAGGCCCTGG GCAGGTTGGT ATCAAGGTTA 240 

CAAGACAGGT TTAAGGAGAC CAATAGAAAC TGGGCATGTG GAGACAGAGA AGACTCTTGG 300 

GTTTCTGATA GGCACTGACT CTCTCTGCCT ATTGGTCTAT TTTCCCACCC TTAGGCTGCT 360 

GGTGGTCTAC CCTTGGACCT AGAGGTTCTT TGAGTCCTTT GGGGATCTGT CCACTCCTGA 42 0 

TGCTGTTATG GGCAACCCTA AGGTGAAGGC TCATGGCAAG AAAGTGCTCG GTGCCTTTAG 480 

TGATGGCCTG GCTCACCTGG ACAACCTCAA GGGCACCTTT GCCACACTGA GTGAGC 536 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC CAGGGCTGGG CATAAAAGTC 60 

AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA ACTGTGTTCA CTAGCAACCT 12 0 

CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGAAGT CTGCCGTTAC TGCCCTGTGG 180 

GGCAAGGTGA ACGTGGATGA AGTTGGAGGT GAGGCCCTGG GCAGGTTGGT ATCAAGGTTA 24 0 

CAAGACAGGT TTAAGGAGAC CAATAGAAAC TGGGCATGTG GAGACAGAGA AGACTCTTGG 30 0 

GTTTCTGATA GGCACTGACT CTCTCTGCCT ATTGGTCTAT TTTCCCACCC TTAGGCTGCT 360 

GGTGGTCTAC CCTTGGACCC AGAGGTTCTT TGAGTCCTTT GGGGATCTGT CCACTCCTGA 420 

TGCTGTTATG GGCAACCCTA AGGTGAAGGC TCATGGCAAG AAAGTGCTCG GTGCCTTTAG 480 

TGATGGCCTG GCTCACCTGG ACAACCTCAA GGGCACCTTT GCCACACTGA GTGAGC 53 6 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 



GAAUACUCAA GCUUGCAUGC CUGCAGGUCG ACUCUAGAGG AUCCCCGGGU ACCGAGCUCG 



60 



AAUU 



64 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
GGCTGACAAG AAGGAAACTC 20 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
CCAGGCGGCG GCTAGGAGAG ATGGG 25 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 



GGCTGACAAG 


AAGGAAACTC 


GCTGAGACAG 


CAGGGACTTT 


CCACAAGGGG 


ATGTTACGGG 


60 


GAGGTACTGG 


GGAGGAGCCG 


GTCGGGAACG 


CCCACTCTCT 


TGATGTATAA 


ATATCACTGC 


120 


ATTTCGCTCT 


GTATTCAGTC 


GCTCTGCGGA 


GAGGCTGGCA 


GATTGAGCCC 


TGGGAGGTTC 


180 


TCTCCAGCAC 


TAGCAGGTAG 


AGCCTGGGTG 


TTCCCTGCTA 


GACTCTCACC 


AGCACTTGGC 


240 


CGGTGCTGGG 


CAGAGTGGCT 


CCACGCTTGC 


TTGCTTAAAG 


ACCTCTTCAA 


TAAAGCTGCC 


300 


ATTTTAGAAG 


TAGGCCAGTG 


TGTGTTCCCA 


TCTCTCCTAG 


CCGCCGCCTG 


G 


351 
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(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

GGCTGACAAG AAGGAAACTC GCTGAGATAG CAGGGACTTT CCACAAGGGG ATGTTATGGG 60 

GAGGAGCCGG TCGGGAACAC CCACTTTCTT GATGTATAAA TATCACTGCA TTTCGCTCTG 120 

TATTCAGTCG CTCTGCGGAG AGGCTGGCAG ATTGAGCCCT GGGAGGTTCT CTCCAGCACT 180 

AGCAGGTAGA GCCTGGGTGT TCCCTGCTAG ACTCTCACCA GCACTTAGCC AGTGCTGGGC 240 

AGAGTGGCTC CACGCTTGCT TGCTTAAAGA CCTCTTCAAT AAAGCTGCCA TTTTAGAAGT 300 

AAGCCAGTGT GTGTTCCCAT CTCTCCTAGC CGCCGCCTGG 340 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

GGCTGACAAG AAGGAAACTC GCTGAGATAG CAGGGACTTT CCACAAGGGG ATGTTATGGG 60 

GAGGAGCCGG TCGGGAACAC CCACTTTCTT GGTGTATAAA TATCACTGCA TTTCGCTCTG 12 0 

TATTCAGTCG CTCTGCGGAG AGGCTGGCAG ATTGAGCCCT GGGAGGTTCT CTCCAGCACT 180 

AGCAGGTAGA GCCTGGGTGT TCCCTGCTAG ACTCTCACCA GCACTTGGCC AGTGCTGGGC 24 0 

AGAGTGGCTC CACGCTTGCT TGCTTAAAGA CCTCTTCAAT AAAGCTGCCA TTTTAGAAGT 300 

AAGCCAGTGT GTGTTCCCAT CTCTCCTAGC CGCCGCCTGG 34 0 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 



GGCTGACAAG 


TV TV ^I^ITi T\ TV ^^m^l 

AAGGAAACTC 


GCTGAGACAG 


CAGGGACTTT 


C C AC AAGGGG 




o U 


GAGGTACTGG 


GGAGGAGCCG 


GTCGGGAACG 


CCCCCTCTCT 


TGATGTATAA 


ATATCACTGC 


120 


ATTTCGCTCT 


GTATTCAGTC 


GCTCTGCGGA 


GAGGCTGGCA 


GATTGAGCCC 


TGGGAGGTTC 


180 


TCTCCAGCAC 


TAGCAGGTAG 


AGCCTGGGTG 


TTCCCTGCTA 


GACTCTCACC 


AGCACTTGGC 


240 


CGGTGCTGGG 


CAGAGTGGCT 


CCACGCTTGC 


TTGCTTAAAG 


ACCTCTTCAA 


TAAAGCTGCC 


300 


ATTTTAGAAG 


TAGGCTAGTG 


TGTGTTCCCA 


TCTCTCCTAG 


CCGCCGCCTG 


G 


351 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



GGCTGACAAG 


AAGGAAACTC 


GCTGAAACAG 


CAGGGACTTT 


CCACAAGGGG 


ATGTTACGGG 


60 


GAGGTACTGG 


GAAGGAGCCG 


GTCGGGAACG 


CCCACTTTCT 


TGATGTATAA 


ATATCACTGC 


120 


ATTTCGCTCT 


GTATTCAGTC 


GCTCTGCGGA 


GAGGCTGGCA 


GATTGAGCCC 


TGGGAGGTTC 


180 


TCTCCAGCAC 


TAGCAGGTAG 


AGCCTGGGTG 


TTCCCTGCTA 


GACTCTCACC 


AGCACTTGGC 


240 


CGGTGCTGGG 


CAGAGTGACT 


CCACGCTTGC 


TTGCTTAAAG 


CCCTCTTCAA 


TAAAGCTGCC 


300 


ATTTTAGAAG 


TAAGCTAGTG 


TGTGTTCCCA 


TCTCTCCTAG 


CCGCCGCCTG 


G 


351 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



GGCTGACAAG 


AAGGAAACTC 


GCTGAGACAG 


CAGGGACTTT 


CCACAAGGGG 


ATGTTACGGA 


60 


GAGGTACTGG 


GGAGGAGCCG 


GTCGGGAACG 


CCCACTCTCT 


TGATGTATAA 


ATATCACTGC 


120 


ATTTCGCTCT 


GTATTCAGTC 


GCTCTGCGGA 


GAGGCTGGCA 


GATTGAGCCC 


TAGGAGGTTC 


180 


TCTCCAGCAC 


TAGCAGGTAG 


AGCCTGAGTG 


TTCCCTGCTA 


AACTCTCACC 


AGCACTTGGC 


240 


CGGTGCTGGG 


CAGAGCGGCT 


CCACGCTTGC 


TTGCTTAAAG 


ACCTCTTCAA 


TAAAGCTGCC 


300 


ATTTTAGAAG 


TAGGCTAGTG 


TGTGTTCCCA 


TCTCTCCTAG 


CCGCCGCCTG 


G 


351 
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(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


120 


CAAACAGACA 


CCATGGTGCA 


TCTGACTCCT 


GAGGAGAAGT 


CTGCCGTTAC 


TGCCCTGTGG 


180 


GGCAAGGTGA 


ACGTGGATGA 


AGTTGGTGGT 


AAGGCCCTGG 


GCAGGTTGGT 


ATCAAGGTTA 


240 


CAAGACAGGT 


TTAAGGAGAC 


CAATAGAAAC 


TGGGCATGTG 


GAGACAGAGA 


AGACTCTTGG 


300 


GTTTCTGATA 


GGCACTGACT 


CTCTCTGCCT 


ATTGGTCTAT 


TTTCCCACCC 


TTAGGCTGCT 


360 


GGTGGTCTAC 


CCTTGGACCC 


AGAGGTTCTT 


TGAGTCCTTT 


GGGGATCTGT 


CCACTCCTGA 


420 


TGCTGTTATG 


GGCAACCCTA 


AGGTGAAGGC 


TCATGGCAAG 


AAAGTGCTCG 


GTGCCTTTAG 


480 


TGATGGCCTG 


GCTCACCTGG 


ACAACCTCAA 


GGGCACCTTT 


GCCACACTGA 


GTGAGC 


536 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



GGTTGGCCAA 


TCTACTCCCA 


GGAGCAGGGA 


GGGCAGGAGC 


CAGGGCTGGG 


CATAAAAGTC 


60 


AGGGCAGAGC 


CATCTATTGC 


TTACATTTGC 


TTCTGACACA 


ACTGTGTTCA 


CTAGCAACCT 


120 


CAAACAGACA 


CCATGGTGCA 


CCTGACTCCT 


GAGGAGAAGT 


CTGCCGTTAC 


TGCCCTGTGG 


180 


GGCAAGGTGA 


ACGTGGATGA 


AGTTGGTGGT 


GAGGCCCTGG 


GCAGGTTGGT 


ATCAAGGTTA 


240 


CAAGACAGGT 


TTAAGGAGAC 


CAATAGAAAC 


TGGGCATGTG 


GAGACAGAGA 


AGACTCTTGG 


300 


GTTTCTGATA 


GGCACTGACT 


CTCTCTGCCT 


ATTAGTCTAT 


TTTCCCACCC 


TTAGGCTGCT 


360 


GGTGGTCTAC 


CCTTGGACCC 


AGAGGTTCTT 


TGAGTCCTTT 


GGGGATCTGT 


CCACTCCTGA 


420 


TGCTGTTATG 


GGCAACCCTA 


AGGTGAAGGC 


TCATGGCAAG 


AAAGTGCTCG 


GTGCCTTTAG 


480 


TGATGGCCTG 


GCTCACCTGG 


ACAACCTCAA 


GGGCACCTTT 


GCCACACTGA 


GTGAGC 


536 
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(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCAGGAA 60 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 120 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 157 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 



485 



490 



495 
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Asp Glu Leu Gly Leu Pro 
500 

Arg Ser Thr Ser Ala Ala 
515 

He Val Glu Lys He Leu 
530 

Thr Tyr He Asp Pro Leu 
545 550 



Ala He Gly Lys Thr Glu 
505 

Val Leu Glu Ala Leu Arg 
520 

Gin Tyr Arg Glu Leu Thr 
535 540 

Pro Asp Leu He His Pro 
555 



Lys Thr Gly Lys 
510 

Glu Ala His Pro 
525 

Lys Leu Lys Ser 



Arg Thr Gly Arg 
560 



Leu His Thr Arg Phe Asn 
565 



Gin Thr Ala Thr Ala 
570 



Thr Gly Arg Leu Ser 
575 



Ser Ser Asp Pro Asn Leu 
580 

Gin Arg He Arg Arg Ala 
595 

Ala Leu Asp Tyr Ser Gin 
610 

Gly Asp Glu Asn Leu He 
625 630 

Thr Glu Thr Ala Ser Trp 
645 

Pro Leu Met Arg Arg Ala 
660 

Gly Met Ser Ala His Arg 
675 

Glu Ala Gin Ala Phe He 
690 

Arg Ala Trp He Glu Lys 
705 710 



Gin Asn He Pro Val Arg 
585 

Phe He Ala Glu Glu Gly 
600 

He Glu Leu Arg Val Leu 
615 620 

Arg Val Phe Gin Glu Gly 
635 

Met Phe Gly Val Pro Arg 
650 

Ala Lys Thr He Asn Phe 
665 

Leu Ser Gin Glu Leu Ala 
680 

Glu Arg Tyr Phe Gin Ser 
695 700 

Thr Leu Glu Glu Gly Arg 
715 



Thr Pro Leu Gly 
590 

Trp Leu Leu Val 
605 

Ala His Leu Ser 



Arg Asp He His 
640 

Glu Ala Val Asp 
655 

Gly Val Leu Tyr 
670 

He Pro Tyr Glu 
685 

Phe Pro Lys Val 



Arg Arg Gly Tyr 
720 



Val Glu Thr Leu Phe Gly 
725 



Arg Arg Arg Tyr Val 
730 



Pro Asp Leu Glu Ala 
735 



Arg Val Lys Ser Val Arg 
740 



Glu Ala Ala Glu Arg 
745 



Met Ala Phe Asn Met 
750 



Pro Val Arg Gly Thr Ala 
755 



Ala Asp Leu Met Lys 
760 



Leu Ala Met Val Lys 
765 



Leu Phe Pro Arg Leu Glu 
770 

His Asp Glu Leu Val Leu 
785 790 



Glu Met Gly Ala Arg Met 
775 780 

Glu Ala Pro Lys Glu Arg 
795 



Leu Leu Gin Val 



Ala Glu Ala Val 
800 
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Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 
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Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Ala Cys Lys Leu Gly Thr Gly Arg Arg 
530 535 540 

Phe Thr Thr Ser 
545 
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(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 
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Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 
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Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala Ser His Pro Leu 
675 680 685 

Arg Gly Gly Pro Gly Leu His 
690 695 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ser Trp Arg Gly Cys lie Pro Trp Pro Cys Pro 
290 295 300 

Trp Arg Trp Arg Trp Gly 
305 310 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser Gly 
15 10 15 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
20 25 30 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
50 55 60 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val Val Phe Asp 
65 70 75 80 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 
85 90 95 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu lie 
100 105 110 

Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val Pro Gly 
115 120 125 
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Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys Ala Glu Lys 
130 135 140 

Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp Leu Tyr Gin 
145 150 155 160 

Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu Gly Tyr Leu lie 
165 - 170 175 

Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp Gin Trp 
180 185 190 

Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu Pro Gly 
195 200 205 

Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp 
210 215 220 

Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala 
225 230 235 240 

lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu Lys Leu Ser Trp 
245 250 255 

Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp Phe Ala 
260 265 270 

Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu Glu Arg 
275 280 285 

Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro 
290 295 300 

Lys Ser Trp Arg Gly Cys lie Pro Trp Pro Cys Pro Trp Arg Trp Arg 
305 310 315 320 

Trp Gly 

INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
15 10 15 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
20 25 30 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
35 40 45 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val Val Phe Asp 
50 55 60 
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Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 
65 70 75 80 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu lie 
85 90 95 

Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val Pro Gly 
100 105 110 

Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys Ala Glu Lys 
115 120 125 

Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp Leu Tyr Gin 
130 135 140 

Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu Gly Tyr Leu lie 
145 150 155 160 

Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp Gin Trp 
165 170 175 

Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu Pro Gly 
180 185 190 

Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp 
195 200 205 

Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala 
210 215 220 

lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu Lys Leu Ser Trp 
225 230 235 240 

Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp Phe Ala 
245 250 255 

Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu Glu Arg 
260 265 270 

Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro 
275 280 285 

Lys lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val Ala 
290 295 300 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
305 310 315 320 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His Thr 
325 330 335 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
340 345 350 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
355 360 365 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
370 375 380 



Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
385 390 395 400 



Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
405 410 415 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
420 425 430 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
435 440 445 

Val Arg Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
450 455 460 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
465 470 475 480 

Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
485 490 495 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
500 505 510 

Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Glu 
515 520 525 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser Gly 
15 10 15 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
20 25 30 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
50 55 60 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val Val Phe Asp 
65 70 75 80 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 
85 90 95 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu lie 
100 105 110 

Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val Pro Gly 
115 120 125 
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Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys Ala Glu Lys 
130 135 140 

Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp Leu Tyr Gin 
145 150 155 160 

Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr Leu He 
165 170 175 

Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp Gin Trp 
180 185 190 

Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu Pro Gly 
195 200 205 

Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp 
210 215 220 

Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala 
225 230 235 240 

He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu Ser Trp 
245 250 255 

Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp Phe Ala 
260 265 270 

Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu Glu Arg 
275 280 285 

Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro 
290 295 300 

Lys Ala Ala Leu Glu His His His His His His 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



ATGGAGGAGC 


CGCAGTCAGA 


TCCTAGCGTC 


GAGCCCCCTC 


TGAGTCAGGA 


AACATTTTCA 


60 


GACCTATGGA 


AACTACTTCC 


TGAAAACAAC 


GTTCTGTCCC 


CCTTGCCGTC 


CCAAGCAATG 


120 


GATGATTTGA 


TGCTGTCCCC 


GGACGATATT 


GAACAATGGT 


TCACTGAAGA 


CCCAGGTCCA 


180 


GATGAAGCTC 


CCAGAATGCC 


AGAGGCTGCT 


CCCCCCGTGG 


CCCCTGCACC 


AGCAGCTCCT 


240 


ACACCGGCGG 


CCCCTGCACC 


AGCCCCCTCC 


TGGCCCCTGT 


CATCTTCTGT 


CCCTTCCCAG 


300 


AAAACCTACC 


AGGGCAGCTA 


CGGTTTCCGT 


CTGGGCTTCT 


TGCATTCTGG 


GACAGCCAAG 


360 


TCTGTGACTT 


GCACGTACTC 


CCCTGCCCTC 


AACAAGATGT 


TTTGCCAACT 


GGCCAAGACC 


420 
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• 






• 




TGCCCTGTGC 


AGCTGTGGGT 


TGATTCCACA 


CCCCCGCCCG 


GCACCCGCGT 


CCGCGCCATG 


480 


GCCATCTACA 


AGCAGTCACA 


GCACATGACG 


GAGGTTGTGA 


GGCGCTGCCC 


CCACCATGAG 


540 


CGCTGCTCAG 


ATAGCGATGG 


TCTGGCCCCT 


CCTCAGCATC 


TTATCCGAGT 


GGAAGGAAAT 


600 


TTGCGTGTGG 


AGTATTTGGA 


TGACAGAAAC 


ACTTTTCGAC 


ATAGTGTGGT 


GGTGCCCTAT 


660 


GAGCCGCCTG 


AGGTTGGCTC 


TGACTGTACC 


ACCATCCACT 


ACAACTACAT 


GTGTAACAGT 


.720 


TCCTGCATGG 


GCGGCATGAA 


CCGGAGGCCC 


ATCCTCACCA 


TCATCACACT 


GGAAGACTCC 


780 


AGTGGTAATC 


TACTGGGACG 


GAACAGCTTT 


GAGGTGCGTG 


TTTGTGCCTG 


TCCTGGGAGA 


840 


GACCGGCGCA 


CAGAGGAAGA 


GAATCTCCGC 


AAGAAAGGGG 


AGCCTCACCA 


CGAGCTGCCC 


900 


CCAGGGAGCA 


CTAAGCGAGC 


ACTGCCCAAC 


AACACCAGCT 


CCTCTCCCCA 


GCCAAAGAAG 


960 


AAACCACTGG 


ATGGAGAATA 


TTTCACCCTT 


CAGATCCGTG 


GGCGTGAGCG 


CTTCGAGATG 


1020 


TTCCGAGAGC 


TGAATGAGGC 


CTTGGAACTC 


AAGGATGCCC 


AGGCTGGGAA 


GGAGCCAGGG 


1080 


GGGAGCAGGG 


CTCACTCCAG 


CCACCTGAAG 


TCCAAAAAGG 


GTCAGTCTAC 


CTCCCGCCAT 


1140 


AAAAAACTCA 


TGTTCAAGAC 


AGAAGGGCCT 


GACTCAGACT 


GA 




1182 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

ATGGAGGAGC CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC TGAGTCAGGA AACATTTTCA 60 

GACCTATGGA AACTACTTCC TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 120 

GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT TCACTGAAGA CCCAGGTCCA 180 

GATGAAGCTC CCAGAATGCC AGAGGCTGCT CCCCCCGTGG CCCCTGCACC AGCAGCTCCT 240 

ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT CATCTTCTGT CCCTTCCCAG 300 

AAAACCTACC AGGGCAGCTA CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 360 

TCTGTGACTT GCACGTACTC CCCTGCCCTC AACAAGATGT TTTGCCAACT GGCCAAGACC 42 0 

TGCCCTGCGC AGCTGTGGGT TGATTCCACA CCCCCGCCCG GCACCCGCGT CCGCGCCATG 4 80 

GCCATCTACA AGCAGTCACA GCACATGACG GAGGTTGTGA GGCGCTGCCC CCACCATGAG 540 

CGCTGCTCAG ATAGCGATGG TCTGGCCCCT CCTCAGCATC TTATCCGAGT GGAAGGAAAT 600 

TTGCGTGTGG AGTATTTGGA TGACAGAAAC ACTTTTCGAC ATAGTGTGGT GGTGCCCTAT 660 

GAGCCGCCTG AGGTTGGCTC TGACTGTACC ACCATCCACT ACAACTACAT GTGTAACAGT 72 0 
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TCCTGCATGG GCGGCATGAA CCGGAGGCCC ATCCTCACCA TCATCACACT GGAAGACTCC 780 

AGTGGTAATC TACTGGGACG GAACAGCTTT GAGGTGCGTG TTTGTGCCTG TCCTGGGAGA 840 

GACCGGCGCA CAGAGGAAGA GAATCTCCGC AAGAAAGGGG AGCCTCACCA CGAGCTGCCC 900 

CCAGGGAGCA CTAAGCGAGC ACTGCCCAAC AACACCAGCT CCTCTCCCCA GCCAAAGAAG 960 

AAACCACTGG ATGGAGAATA TTTCACCCTT CAGATCCGTG GGCGTGAGCG CTTCGAGATG 1020 

TTCCGAGAGC TGAATGAGGC CTTGGAACTC AAGGATGCCC AGGCTGGGAA GGAGCCAGGG 1080 

GGGAGCAGGG CTCACTCCAG CCACCTGAAG TCCAAAAAGG GTCAGTCTAC CTCCCGCCAT 1140 

AAAAAACTCA TGTTCAAGAC AGAAGGGCCT GACTCAGACT GA 1182 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

ATGGAGGAGC CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC TGAGTCAGGA AACATTTTCA 60 

GACCTATGGA AACTACTTCC TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 120 

GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT TCACTGAAGA CCCAGGTCCA 180 

GATGAAGCTC CCAGAATGCC AGAGGCTGCT CCCCCCGTGG CCCCTGCACC AGCAGCTCCT 240 

ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT CATCTTCTGT CCCTTCCCAG 300 

AAAACCTACC AGGGCAGCTA CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 360 

TCTGTGACTT GCACGTACTC CCCTGCCCTC AACAAGATGT TTTGCCAACT GGCCAAGACC 42 0 

TGCCCTGTGC AGCTGTGGGT TGATTCCACA CCCCCGCCCG GCACCCGCGT CCGCGCCATG 4 80 

GCCATCTACA AGCAGTCACA GCACATGACG GAGGTTGTGA GGCGCTGCCC CCACCATGAG 54 0 

CGCTGCTCAG ATAGCGATGG TCTGGCCCCT CCTCAGCATC TTATCCGAGT GGAAGGAAAT 600 

TTGCGTGTGG AGTATTTGGA TGACAGAAAC ACTTTTCGAC ATAGTGTGGT GGTGCCCTAT 660 

GAGCCGCCTG AGGTTGGCTC TGACTGTACC ACCATCCACT ACAACTACAT GTGTAACAGT 72 0 

TCCTGCATGG GCGGCATGAA CCGGAGACCC ATCCTCACCA TCATCACACT GGAAGACTCC 780 

AGTGGTAATC TACTGGGACG GAACAGCTTT GAGGTGCGTG TTTGTGCCTG TCCTGGGAGA 840 

GACCGGCGCA CAGAGGAAGA GAATCTCCGC AAGAAAGGGG AGCCTCACCA CGAGCTGCCC 900 

CCAGGGAGCA CTAAGCGAGC ACTGCCCAAC AACACCAGCT CCTCTCCCCA GCCAAAGAAG 960 

AAACCACTGG ATGGAGAATA TTTCACCCTT CAGATCCGTG GGCGTGAGCG CTTCGAGATG 1020 
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TTCCGAGAGC TGAATGAGGC CTTGGAACTC AAGGATGCCC AGGCTGGGAA GGAGCCAGGG 1080 
GGGAGCAGGG CTCACTCCAG CCACCTGAAG TCCAAAAAGG GTCAGTCTAC CTCCCGCCAT 1140 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
TCTGGGCTTC TTGCATTCTG 20 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTTGGGCAGT GCTCGCTTAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 



AAAAAACTCA TGTTCAAGAC AGAAGGGCCT GACTCAGACT GA 
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TGAGGTGCGT GTTTGTGCCT GTCCTGGGAG AGACCGGCGC ACAGAGGAAG AGAATCTCCG 540 

CAAGAAAGGG GAGCCTCACC ACGAGCTGCC CCCAGGGAGC ACTAAGCGAG CACTGCCCAA 600 

C 601 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGCCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


CCAGAATGCA 


AGAAGCCCAG 


600 



A 601 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 60 

CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGCG CAGCTGTGGG TTGATTCCAC 120 

ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 180 

GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 24 0 

TCCTCAGCAT CTTATCCGAG TGGAAGGAAA TTTGCGTGTG GAGTATTTGG ATGACAGAAA 3 00 
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CACTTTTCGA CATAGTGTGG TGGTGCCCTA TGAGCCGCCT GAGGTTGGCT CTGACTGTAC 360 

CACCATCCAC TACAACTACA TGTGTAACAG TTCCTGCATG GGCGGCATGA ACCGGAGGCC 420 

CATCCTCACC ATCATCACAC TGGAAGACTC CAGTGGTAAT CTACTGGGAC GGAACAGCTT 4 80 

TGAGGTGCGT GTTTGTGCCT GTCCTGGGAG AGACCGGCGC ACAGAGGAAG AGAATCTCCG 54 0 

CAAGAAAGGG GAGCCTCACC ACGAGCTGCC CCCAGGGAGC ACTAAGCGAG CACTGCCCAA 600 

C 601 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGCCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCGCAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


CCAGAATGCA 


AGAAGCCCAG 


600 


A 












601 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:101: 
TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 60 
CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 120 
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• 






• 




ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGACC 


420 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCGT 


GTTTGTGCCT 


GTCCTGGGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 


C 












601 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGTCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC • 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


CCAGAATGCA 


AGAAGCCCAG 


600 


A 












601 



(2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 



GAGGATGGGA CTCCGGTTCA TG 



22 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
CATGAACCGG AGTCCCATCC TCAC 24 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GCACAAACAT GCACCTCAAA GCT 23 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
CAGCTTTGAG GTGCATGTTT GT 22 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 107: 



m r\ m r\ /-* /** f*\ m rri /™i 

TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


^% "TV TV ^1 TV tv "TV m^l 

CAACAAGATG 


TTTTGCCAAC 


rfl/""»/""1 ^"(/""l TV TV S*% TV /~1 

TGGC CAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


mm^"i tv rnm/"T/T tv /*i 

TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


/"■t /*r /"t /"t tv rri rn tv /"1 

GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


/~i tv rn tv tv nrt/"*i 

GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


m/**!/**) TV TV /~t /"* 1 7V TV TA 

TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


TV rpj/"t TV /T TV /™t TV TV TV 

ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


f~\rr\/-\ tv /"irp/~i rp TV ^"t 

CTGACTGTAC 


0 £ f\ 
360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGTCC 


420 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCGT 


GTTTGTGCCT 


GTCCTGGGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 


C 












601 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGACTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


CCAGAATGCA 


AGAAGCCCAG 


600 


A 












601 
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(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCAT 


GTTTGTGCCT 


GTCCTGGGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 



C 601 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

GTTGGGCAGT GCTCGCTTAG TGCTCCCTGG GGGCAGCTCG TGGTGAGGCT CCCCTTTCTT 60 

GCGGAGATTC TCTTCCTCTG TGCGCCGGTC TCTCCCAGGA CAGGCACAAA CATGCACCTC 12 0 

AAAGCTGTTC CGTCCCAGTA GATTACCACT GGAGTCTTCC AGTGTGATGA TGGTGAGGAT 180 

GGGCCTCCGG TTCATGCCGC CCATGCAGGA ACTGTTACAC ATGTAGTTGT AGTGGATGGT 24 0 

GGTACAGTCA GAGCCAACCT CAGGCGGCTC ATAGGGCACC ACCACACTAT GTCGAAAAGT 3 00 

GTTTCTGTCA TCCAAATACT CCACACGCAA ATTTCCTTCC ACTCGGATAA GATGCTGAGG 3 60 

AGGGGCCAGA CCATCGCTAT CTGAGCAGCG CTCATGGTGG GGGCAGCGCC TCACAACCTC 42 0 

CGTCATGTGC TGTGACTGCT TGTAGATGGC CATGGCGCGG ACGCGGGTGC CGGGCGGGGG 4 80 
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TGTGGAATCA ACCCACAGCT GCACAGGGCA GGTCTTGGCC AGTTGGCAAA ACATCTTGTT 



540 



GAGGGCAGGG GAGTACGTGC AAGTCACAGA CTTGGCTGTC CCAGAATGCA AGAAGCCCAG 



600 



A 



601 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 60 

CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 12 0 

ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 180 

GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 240 

TCCTCAGCAT CTTATCCGAG TGGAAGGAAA TTTGCGTGTG GAGTATTTGG ATGACAGAAA 300 

CACTTTTCGA CATAGTGTGG TGGTGCCCTA TGAGCCGCCT GAGGTTGGCT CTGACTGTAC 360 

CACCATCCAC TACAACTACA TGTGTAACAG TTCCTGCATG GGCGGCATGA ACCGGAGTCC 42 0 

CATCCTC 427 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
CATGAACCGG AGTCCCATCC TCACCATCAT CACACTGGAA GACTCCAGTG GTAATCTACT 60 
GGGACGGAAC AGCTTTGAGG TGCGTGTTTG TGCCTGTCCT GGGAGAGACC GGCGCACAGA 120 
GGAAGAGAAT CTCCGCAAGA AAGGGGAGCC TCACCACGAG CTGCCCCCAG GGAGCACTAA 180 
GCGAGCACTG CCCAAC 196 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCAT 


GTTTGTGC 










498 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

CAGCTTTGAG GTGCATGTTT GTGCCTGTCC TGGGAGAGAC CGGCGCACAG AGGAAGAGAA 60 

TCTCCGCAAG AAAGGGGAGC CTCACCACGA GCTGCCCCCA GGGAGCACTA AGCGAGCACT 12 0 

GCCCAAC 127 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GGTTTTTCTT TGAGGTTTAG 20 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



GCGACACTCC AC CAT AG AT 



19 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
CTGTCTTCAC GCAGAAAGC 19 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
GCACGGTCTA CGAGACCTC 19 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
GATCTACTAG TCATATGGAT 2 0 

(2) INFORMATION FOR SEQ ID NO:120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 
TCGGTACCCG GGGATCCGAT 2 0 
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(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
CTGTCTTCAC GCAGAAAGCG TCTGGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 60 
GACCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 12 0 

CAGGACGACC GGGTCCTTTC TTGGATAAAC CCGCTCAATG CCTGGAGATT TGGGCGTGCC 180 
CCCGCAAGAC TGCTAGCCGA GTAGTGTTGG GTCGCGAAAG GCCTTGTGGT ACTGCCTGAT 240 
AGGGTGCCTG CGAGTGCCCC GGGAGGTCTC GTAGACCGTG C 2 81 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



CTGTCTTCAC 


GCAGAAAGCG 


TCTGGCCATG 


GCGTTAGTAT 


GAGTGTCGTG 


CAGCCTCCAG 


60 


GACCCCCCCT 


CCCGGGAGAG 


CCATAGTGGT 


CTGCGGAACC 


GGTGACTGTC 


TTCACGCAGA 


120 


AAGCGTCTAG 


CCATGGCGTT 


AGTATGAGTG 


TCGTGCAGCC 


TCCAGGACCC 


CCCCTCCCGG 


180 


GAGAGCCATA 


GTGGTCTGCG 


GAACCGGTGA 


GTACACCGGA 


ATTGCCAGGA 


CGACCGGGTC 


240 


CTTTCTTGGA 


TCAACCCGCT 


CAATGCCTGG 


AGATTTGGGC 


GTGCCCCCGC 


AAGACTGCTA 


300 


GCCGAGTAGT 


GTTGGGTCGC 


GAAAGGCCTT 


GTGGTACTGC 


CTGATAGGGT 


GCTTGCGAGT 


360 


GCCCCGGGAG 


GTCTCGTAGA 


CCGTGC 








386 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 



CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 



60 



GTCCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 



120 



CAGGACGACC GGGTCCTTTC TTGGATCAAC CCGCTCAATG CCTGGAGATT TGGGCGTGCC 



180 



CCCGCGAGAC TGCTAGCCGA GTAGTGTTGG GTCGCGAAAG GCCTTGTGGT ACTGCCTGAT 



240 



AGGGTGCTTG CGAGTGCCCC GGGAGGTCTC GTAGACCGTG C 



281 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 60 
GACCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 12 0 

CAGGACGACC GGGTCCTTTC GTGGATGTAA CCCGCTCAAT GCCTGGAGAT TTGGGCGTGC 180 
CCCCGCAAGA CTGCTAGCCG AGTAGTGTTG GGTCGCGAAA GGCCTTGTGG TACTGCCTGA 24 0 

TAGGGTGCTT GCGAGTGCCC CGGGAGGTCT CGTAGACCGT GC 2 82 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTA CAGCCTCCAG 60 

GCCCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 120 

CGGGAAGACT GGGTCCTTTC TTGGATAAAC CCACTCTATG CCCGGCCATT TGGGCGTGCC 180 

CCCGCAAGAC TGCTAGCCGA GTAGCGTTGG GTTGCGAAAG GCCTTGTGGT ACTGCCTGAT 240 

AGGGTGCTTG CGAGTACCCC GGGAGGTCTC GTAGACCGTG C 2 81 
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(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 



CTGTCTTCAC 


GCAGAAAGCG 


CCTAGCCATG 


GCGTTAGTAC 


GAGTGTCGTG 


CAGCCTCCAG 


60 


GACCCCCCCT 


CCCGGGAGAG 


CCATAGTGGT 


CTGCGGAACC 


GGTGAGTACA 


CCGGAATCGC 


120 


TGGGGTGACC 


GGGTCCTTTC 


TTGGAGCAAC 


CCGCTCAATA 


CCCAGAAATT 


TGGGCGTGCC 


180 


CCCGCGAGAT 


CACTAGCCGA 


GTAGTGTTGG 


GTCGCGAAAG 


GCCTTGTGGT 


ACTGCCTGAT 


240 


AGGGTGCTTG 


CGAGTGCCCC 


GGGAGGTCTC 


GTAGACCGTG 


C 




281 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAGGCACCC TATCAGGCAG TACCACAAGG 60 
CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTTATCCA AGAAAGGACC CGGTCGTCCT GGCAATTCCG GTGTACTCAC 180 
CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 24 0 

CATACTAACG CCATGGCCAG ACGCTTTCTG CGTGAAGACA G 2 81 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 
CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTGAT CCA AGAAAGGACC CGGTCGTCCT GGCAATTCCG GTGTACTCAC 180 
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CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 



240 



CATACTAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 



281 



(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 
CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTCGCGG GGGCACGCCC AAATCTCCAG 120 
GCATTGAGCG GGTTGATCCA AGAAAGGACC CGGTCGTCCT GGCAATTCCG GTGTACTCAC 180 
CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGA CCTGGAGGCT GCACGACACT 240 
CATACTAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 281 
(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 
CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 120 
GCATTGAGCG GGTTACATCC ACGAAAGGAC CCGGTCGTCC TGGCAATTCC GGTGTACTCA 180 
CCGGTTCCGC AGACCACTAT GGCTCTCCCG GGAGGGGGGG TCCTGGAGGC TGCACGACAC 240 
TCATACTAAC GCCATGGCTA GACGCTTTCT GCGTGAAGAC AG 282 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



GCACGGTCTA CGAGACCTCC CGGGGTACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 



60 



CCTTTCGCAA CCCAACGCTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATGGCCGG 



120 



GCATAGAGTG GGTTTATCCA AGAAAGGACC CAGTCTTCCC GGCAATTCCG GTGTACTCAC 



180 



CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGG CCTGGAGGCT GTACGACACT 



240 



CATACTAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 



281 



(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 
CCTTTCGCGA CCCAACACTA CTCGGCTAGT GATCTCGCGG GGGCACGCCC AAATTTCTGG 120 
GTATTGAGCG GGTTGCTCCA AGAAAGGACC CGGTCACCCC AGCGATTCCG GTGTACTCAC 180 
CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 240 
CGTACTAACG CCATGGCTAG GCGCTTTCTG CGTGAAGACA G 281 
(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
ATCAACATCC GGCCGGTGGT 2 0 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
GGGGCCTCGC TACGGACCAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5: 

ATCAACATCC GGCCGGTGGT CGCCGCGATC AAGGAGTTCT TCGGCACCAG CCAGCTGAGC 60 

CAATTCATGG ACCAGAACAA CCCGCTGTCG GGGTTGACCC ACAAGCGCCG ACTGTCGGCG 12 0 

CTGGGGCCCG GCGGTCTGTC ACGTGAGCGT GCCGGGCTGG AGGTCCGCGA CGTGCACCCG 180 

TCGCACTACG GCCGGATGTG CCCGATCGAA ACCCCTGAGG GGCCCAACAT CGGTCTGATC 240 

GGCTCGCTGT CGGTGTACGC GCGGGTCAAC CCGTTCGGGT TCATCGAAAC GCCGTACCGC 300 

AAGGTGGTCG ACGGCGTGGT TAGCGACGAG ATCGTGTACC TGACCGCCGA CGAGGAGGAC 360 

CGCCACGTGG TGGCACAGGC CAATTCGCCG ATCGATGCGG ACGGTCGCTT CGTCGAGCCG 42 0 

CGCGTGCTGG TCCGCCGCAA GGCGGGCGAG GTGGAGTACG TGCCCTCGTC TGAGGTGGAC 480 

TACATGGACG TCTCGCCCCG CCAGATGGTG TCGGTGGCCA CCGCGATGAT TCCCTTCCTG 54 0 

GAGCACGACG ACGCCAACCG TGCCCTCATG GGGGCAAACA TGCAGCGCCA GGCGGTGCCG 60 0 

CTGGTCCGTA GCGAGGCCCC 62 0 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

ATCAACATCC GGCCGGTGGT CGCCGCGATC AAGGAGTTCT TCGGCACCAG CCAGCTGAGC 60 

CAATTCATGG ACCAGAACAA CCCGCTGTCG GGGTTGACCT ACAAGCGCCG ACTGTCGGCG 12 0 

CTGGGGCCCG GCGGTCTGTC ACGTGAGCGT GCCGGGCTGG AGGTCCGCGA CGTGCACCCG 180 

TCGCACTACG GCCGGATGTG CCCGATCGAA ACCCCTGAGG GGCCCAACAT CGGTCTGATC 240 
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GGCTCGCTGT CGGTGTACGC GCGGGTCAAC CCGTTCGGGT TCATCGAAAC GCCGTACCGC 300 

AAGGTGGTCG ACGGCGTGGT TAGCGACGAG ATCGTGTACC TGACCGCCGA CGAGGAGGAC 360 

CGCCACGTGG TGGCACAGGC CAATTCGCCG ATCGATGCGG ACGGTCGCTT CGTCGAGCCG 420 

CGCGTGCTGG TCCGCCGCAA GGCGGGCGAG GTGGAGTACG TGCCCTCGTC TGAGGTGGAC 4 80 

TACATGGACG TCTCGCCCCG CCAGATGGTG TCGGTGpCCA CCGCGATGAT TCCCTTCCTG 54 0 

GAGCACGACG ACGCCAACCG TGCCCTCATG GGGGCAAACA TGCAGCGCCA GGCGGTGCCG 600 

CTGGTCCGTA GCGAGGCCCC 62 0 
(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7 



ATCAACATCC 


GGCCGGTGGT 


CGCCGCGATC 


AAGGAGTTCT 


TCGGCACCAG 


CCAGCTGAGC 


60 


CAATTCATGG 


ACCAGAACAA 


CCCGCTGTCG 


GGGTTGACCC 


ACAAGCGCCG 


ACTGTTGGCG 


120 


CTGGGGCCCG 


GCGGTCTGTC 


ACGTGAGCGT 


GCCGGGCTGG 


AGGTCCGCGA 


CGTGCACCCG 


180 


TCGCACTACG 


GCCGGATGTG 


CCCGATCGAA 


ACCCCTGAGG 


GGCCCAACAT 


CGGTCTGATC 


240 


GGCTCGCTGT 


CGGTGTACGC 


GCGGGTCAAC 


CCGTTCGGGT 


TCATCGAAAC 


GCCGTACCGC 


300 


AAGGTGGTCG 


ACGGCGTGGT 


TAGCGACGAG 


ATCGTGTACC 


TGACCGCCGA 


CGAGGAGGAC 


360 


CGCCACGTGG 


TGGCACAGGC 


CAATTCGCCG 


ATCGATGCGG 


ACGGTCGCTT 


CGTCGAGCCG 


420 


CGCGTGCTGG 


TCCGCCGCAA 


GGCGGGCGAG 


GTGGAGTACG 


TGCCCTCGTC 


TGAGGTGGAC 


480 


TACATGGACG 


TCTCGCCCCG 


CCAGATGGTG 


TCGGTGGCCA 


CCGCGATGAT 


TCCCTTCCTG 


540 


GAGCACGACG 


ACGCCAACCG 


TGCCCTCATG 


GGGGCAAACA 


TGCAGCGCCA 


GGCGGTGCCG 


600 


CTGGTCCGTA 


GCGAGGCCCC 










620 



(2) INFORMATION FOR SEQ ID NO:138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



GGGGCCTCGC 


TACGGACCAG 


CGGCACCGCC 


TGGCGCTGCA 


TGTTTGCCCC 


^1 TV PTT^I TV ^% TV 

CATGAGGGCA 


60 


CGGTTGGCGT 


CGTCGTGCTC 


CAGGAAGGGA 


ATCATCGCGG 


TGGCCACCGA 


CACCATCTGG 


120 


CGGGGCGAGA 


CGTCCATGTA 


rn /"■» /"i 7v /™t /""t rn /"i tv 

GTCCACCTCA 


GACGAGGGCA 


CGTACTCCAC 


CTCGCCCGCC 


180 


TTGCGGCGGA 


C C AGC ACGCG 


CGGCTCGACG 


7V TV /"I 7V /~i /~i/~irri 

AAGCGACCGT 


CCGCATCGAT 


/*i/*i/"*t/" , 1/"'i TV TV mm/*i 

CGGCGAATTG 


240 


CjCLICjIOLLA 


LLALb 1 CjCjLCj 




t» /~< /™< r* /™"T> /™< 7v 
rCCj(aC(jLj I LA 


CjCj I AC ALGA1 


CTCGTCGCTA 


3 00 


ACCACGCCGT 


CGACCACCTT 


GCGGTACGGC 


GTTTCGATGA 


TV /""l /""I TV TV V*" 1 ! ^"1 

ACCCGAACGG 


GTTGACCCGC 


360 


GCGTACACCG 


ACAGCGAGCC 


GATCAGACCG 


ATGTTGGGCC 


CCTCAGGGGT 


TTCGATCGGG 


420 


CACATCCGGC 


CGTAGTGCGA 


CGGGTGCACG 


TCGCGGACCT 


CCAGCCCGGC 


ACGCTCACGT 


480 


GACAGACCGC 


CGGGCCCCAG 


CGCCGACAGT 


CGGCGCTTGT 


GGGTCAACCC 


CGACAGCGGG 


540 


TTGTTCTGGT 


CCATGAATTG 


GCTCAGCTGG 


CTGGTGCCGA 


AGAACTCCTT 


GATCGCGGCG 


600 


ACCACCGGCC 


GGATGTTGAT 










620 



(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 



GGGGCCTCGC 


TACGGACCAG 


CGGCACCGCC 


TGGCGCTGCA 


TGTTTGCCCC 


CATGAGGGCA 


60 


CGGTTGGCGT 


CGTCGTGCTC 


CAGGAAGGGA 


ATCATCGCGG 


TGGCCACCGA 


CACCATCTGG 


120 


CGGGGCGAGA 


CGTCCATGTA 


GTCCACCTCA 


GACGAGGGCA 


CGTACTCCAC 


CTCGCCCGCC 


180 


TTGCGGCGGA 


CCAGCACGCG 


CGGCTCGACG 


AAGCGACCGT 


CCGCATCGAT 


CGGCGAATTG 


240 


GCCTGTGCCA 


CCACGTGGCG 


GTCCTCCTCG 


TCGGCGGTCA 


GGTACACGAT 


CTCGTCGCTA 


300 


ACCACGCCGT 


CGACCACCTT 


GCGGTACGGC 


GTTTCGATGA 


ACCCGAACGG 


GTTGACCCGC 


360 


GCGTACACCG 


ACAGCGAGCC 


GATCAGACCG 


ATGTTGGGCC 


CCTCAGGGGT 


TTCGATCGGG 


420 


CACATCCGGC 


CGTAGTGCGA 


CGGGTGCACG 


TCGCGGACCT 


CCAGCCCGGC 


ACGCTCACGT 


480 


GACAGACCGC 


CGGGCCCCAG 


CGCCGACAGT 


CGGCGCTTGT 


AGGTCAACCC 


CGACAGCGGG 


540 


TTGTTCTGGT 


CCATGAATTG 


GCTCAGCTGG 


CTGGTGCCGA 


AGAACTCCTT 


GATCGCGGCG 


600 


ACCACCGGCC 


GGATGTTGAT 










620 
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(2) INFORMATION FOR SEQ ID NO: 14 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:140: 

GGGGCCTCGC TACGGACCAG CGGCACCGCC TGGCGCTGCA TGTTTGCCCC CATGAGGGCA 60 

CGGTTGGCGT CGTCGTGCTC CAGGAAGGGA ATCATCGCGG TGGCCACCGA CACCATCTGG 12 0 

CGGGGCGAGA CGTCCATGTA GTCCACCTCA GACGAGGGCA CGTACTCCAC CTCGCCCGCC 180 

TTGCGGCGGA CCAGCACGCG CGGCTCGACG AAGCGACCGT CCGCATCGAT CGGCGAATTG 24 0 

GCCTGTGCCA CCACGTGGCG GTCCTCCTCG TCGGCGGTCA GGTACACGAT CTCGTCGCTA 3 00 

ACCACGCCGT CGACCACCTT GCGGTACGGC GTTTCGATGA ACCCGAACGG GTTGACCCGC 3 60 

GCGTACACCG ACAGCGAGCC GATCAGACCG ATGTTGGGCC CCTCAGGGGT TTCGATCGGG 42 0 

CACATCCGGC CGTAGTGCGA CGGGTGCACG TCGCGGACCT CCAGCCCGGC ACGCTCACGT 480 

GACAGACCGC CGGGCCCCAG CGCCAACAGT CGGCGCTTGT GGGTCAACCC CGACAGCGGG 540 

TTGTTCTGGT CCATGAATTG GCTCAGCTGG CTGGTGCCGA AGAACTCCTT GATCGCGGCG 600 

ACCACCGGCC GGATGTTGAT 62 0 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
AGCTCGTATG GCACCGGAAC 2 0 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:142: 
TTGACCTCCC ACCCGACTTG 2 0 
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(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 



AGCTCGTATG 


GCACCGGAAC 


CGGTAAGGAC 


GCGATCACCA 


GCGGCATCGA 


GGTCGTATGG 


60 


ACGAACACCC 


CGACGAAATG 


GGACAACAGT 


TTCCTCGAGA 


TCCTGTACGG 


CTACGAGTGG 


120 


GAGCTGACGA 


AGAGCCCTGC 


TGGCGCTTGG 


CAATACACCG 


CCAAGGACGG 


CGCCGGTGCC 


180 


GGCACCATCC 


CGGACCCGTT 


CGGCGGGCCA 


GGGCGCTCCC 


CGACGATGCT 


GGCCACTGAC 


240 


CTCTCGCTGC 


GGGTGGATCC 


GATCTATGAG 


CGGATCACGC 


GTCGCTGGCT 


GGAACACCCC 


300 


GAGGAATTGG 


CCGACGAGTT 


CGCCAAGGCC 


TGGTACAAGC 


TGATCCACCG 


AGACATGGGT 


360 


CCCGTTGCGA 


GATACCTTGG 


GCCGCTGGTC 


CCCAAGCAGA 


CCCTGCTGTG 


GCAGGATCCG 


420 


GTCCCTGCGG 


TCAGCCACGA 


CCTCGTCGGC 


GAAGCCGAGA 


TTGCCAGCCT 


TAAGAGCCAG 


480 


ATCCGGGCAT 


CGGGATTGAC 


TGTCTCACAG 


CTAGTTTCGA 


CCGCATGGGC 


GGCGGCGTCG 


540 


TCGTTCCGTG 


GTAGCGACAA 


GCGCGGCGGC 


GCCAACGGTG 


GTCGCATCCG 


CCTGCAGCCA 


600 


CAAGTCGGGT 


GGGAGGTCAA 
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(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:144: 

AGCTCGTATG GCACCGGAAC CGGTAAGGAC GCGATCACCA CCGGCATCGA GGTCGTATGG 60 

ACGAACACCC CGACGAAATG GGACAACAGT TTCCTCGAGA TCCTGTACGG CTACGAGTGG 120 

GAGCTGACGA AGAGCCCTGC TGGCGCTTGG CAATACACCG CCAAGGACGG CGCCGGTGCC 180 

GGCACCATCC CGGACCCGTT CGGCGGGCCA GGGCGCTCCC CGACGATGCT GGCCACTGAC 240 

CTCTCGCTGC GGGTGGATCC GATCTATGAG CGGATCACGC GTCGCTGGCT GGAACACCCC 300 

GAGGAATTGG CCGACGAGTT CGCCAAGGCC TGGTACAAGC TGATCCACCG AGACATGGGT 360 

CCCGTTGCGA GATACCTTGG GCCGCTGGTC CCCAAGCAGA CCCTGCTGTG GCAGGATCCG 420 

GTCCCTGCGG TCAGCCACGA CCTCGTCGGC GAAGCCGAGA TTGCCAGCCT TAAGAGCCAG 4 80 
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ATCCGGGCAT CGGGATTGAC TGTCTCACAG CTAGTTTCGA CCGCATGGGC GGCGGCGTCG 540 

TCGTTCCGTG GTAGCGACAA GCGCGGCGGC GCCAACGGTG GTCGCATCCG CCTGCAGCCA 600 

CAAGTCGGGT GGGAGGTCAA 620 
(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



AGCTCGTATG 


GCACCGGAAC 


CGGTAAGGAC 


GCGATCACCA 


GCGGCATCGA 


GGTCGTATGG 


60 


ACGAACACCC 


CGACGAAATG 


GGACAACAGT 


TTCCTCGAGA 


TCCTGTACGG 


CTACGAGTGG 


120 


GAGCTGACGA 


AGAGCCCTGC 


TGGCGCTTGG 


CAATACACCG 


CCAAGGACGG 


CGCCGGTGCC 


180 


GGCACCATCC 


CGGACCCGTT 


CGGCGGGCCA 


GGGCGCTCCC 


CGACGATGCT 


GGCCACTGAC 


240 


CTCTCGCTGC 


GGGTGGATCC 


GATCTATGAG 


CGGATCACGC 


GTCGCTGGCT 


GGAACACCCC 


300 


GAGGAATTGG 


CCGACGAGTT 


CGCCAAGGCC 


TGGTACAAGC 


TGATCCACCG 


AGACATGGGT 


360 


CCCGTTGCGA 


GATACCTTGG 


GCCGCTGGTC 


CCCAAGCAGA 


CCCTGCTGTG 


GCAGGATCCG 


420 


GTCCCTGCGG 


TCAGCCACGA 


CCTCGTCGGC 


GAAGCCGAGA 


TTGCCAGCCT 


TAAGAGCCAG 


480 


ATCCTGGCAT 


CGGGATTGAC 


TGTCTCACAG 


CTAGTTTCGA 


CCGCATGGGC 


GGCGGCGTCG 


540 


TCGTTCCGTG 


GTAGCGACAA 


GCGCGGCGGC 


GCCAACGGTG 


GTCGCATCCG 


CCTGCAGCCA 


600 


CAAGTCGGGT 


GGGAGGTCAA 
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(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 

AGCTCGTATG GCACCGGAAC CGGTAAGGAC GCGATCACCA CCGGCATCGA GGTCGTATGG 60 

ACGAACACCC CGACGAAATG GGACAACAGT TTCCTCGAGA TCCTGTACGG CTACGAGTGG 12 0 

GAGCTGACGA AGAGCCCTGC TGGCGCTTGG CAATACACCG CCAAGGACGG CGCCGGTGCC 180 

GGCACCATCC CGGACCCGTT CGGCGGGCCA GGGCGCTCCC CGACGATGCT GGCCACTGAC 24 0 

CTCTCGCTGC GGGTGGATCC GATCTATGAG CGGATCACGC GTCGCTGGCT GGAACACCCC 300 
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GAGGAATTGG CCGACGAGTT CGCCAAGGCC TGGTACAAGC TGATCCACCG AGACATGGGT 360 

CCCGTTGCGA GATACCTTGG GCCGCTGGTC CCCAAGCAGA CCCTGCTGTG GCAGGATCCG 42 0 

GTCCCTGCGG TCAGCCACGA CCTCGTCGGC GAAGCCGAGA TTGCCAGCCT TAAGAGCCAG 480 

ATCCTGGCAT CGGGATTGAC TGTCTCACAG CTAGTTTCGA CCGCATGGGC GGCGGCGTCG 54 0 

TCGTTCCGTG GTAGCGACAA GCGCGGCGGC GCCAACGGTG GTCGCATCCG CCTGCAGCCA 600 

CAAGTCGGGT GGGAGGTCAA 62 0 
(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

TTGACCTCCC ACCCGACTTG TGGCTGCAGG CGGATGCGAC CACCGTTGGC GCCGCCGCGC 60 

TTGTCGCTAC CACGGAACGA CGACGCCGCC GCCCATGCGG TCGAAACTAG CTGTGAGACA 12 0 

GTCAATCCCG ATGCCCGGAT CTGGCTCTTA AGGCTGGCAA TCTCGGCTTC GCCGACGAGG 180 

TCGTGGCTGA CCGCAGGGAC CGGATCCTGC CACAGCAGGG TCTGCTTGGG GACCAGCGGC 24 0 

CCAAGGTATC TCGCAACGGG ACCCATGTCT CGGTGGATCA GCTTGTACCA GGCCTTGGCG 3 00 

AACTCGTCGG CCAATTCCTC GGGGTGTTCC AGCCAGCGAC GCGTGATCCG CTCATAGATC 360 

GGATCCACCC GCAGCGAGAG GTCAGTGGCC AGCATCGTCG GGGAGCGCCC TGGCCCGCCG 42 0 

AACGGGTCCG GGATGGTGCC GGCACCGGCG CCGTCCTTGG CGGTGTATTG CCAAGCGCCA 480 

GCAGGGCTCT TCGTCAGCTC CCACTCGTAG CCGTACAGGA TCTCGAGGAA ACTGTTGTCC 54 0 

CATTTCGTCG GGGTGTTCGT CCATACGACC TCGATGCCGC TGGTGATCGC GTCCTTACCG 60 0 

GTTCCGGTGC CATACGAGCT 620 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
TTGACCTCCC ACCCGACTTG TGGCTGCAGG CGGATGCGAC CACCGTTGGC GCCGCCGCGC 60 
TTGTCGCTAC CACGGAACGA CGACGCCGCC GCCCATGCGG TCGAAACTAG CTGTGAGACA 120 



-85- 



# 



GTCAATCCCG ATGCCCGGAT CTGGCTCTTA AGGCTGGCAA TCTCGGCTTC GCCGACGAGG 180 

TCGTGGCTGA CCGCAGGGAC CGGATCCTGC CACAGCAGGG TCTGCTTGGG GACCAGCGGC 240 

CCAAGGTATC TCGCAACGGG ACCCATGTCT CGGTGGATCA GCTTGTACCA GGCCTTGGCG 300 

AACTCGTCGG CCAATTCCTC GGGGTGTTCC AGCCAGCGAC GCGTGATCCG CTCATAGATC 360 

GGATCCACCC GCAGCGAGAG GTCAGTGGCC AGCATCGTCG GGGAGCGCCC TGGCCCGCCG 420 

AACGGGTCCG GGATGGTGCC GGCACCGGCG CCGTCCTTGG CGGTGTATTG CCAAGCGCCA 480 

GCAGGGCTCT TCGTCAGCTC CCACTCGTAG CCGTACAGGA TCTCGAGGAA ACTGTTGTCC 540 

CATTTCGTCG GGGTGTTCGT CCATACGACC TCGATGCCGG TGGTGATCGC GTCCTTACCG 600 

GTTCCGGTGC CATACGAGCT 620 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9: 



TTGACCTCCC 


ACCCGACTTG 


TGGCTGCAGG 


CGGATGCGAC 


CACCGTTGGC 


GCCGCCGCGC 


60 


TTGTCGCTAC 


CACGGAACGA 


CGACGCCGCC 


GCCCATGCGG 


TCGAAACTAG 


CTGTGAGACA 


120 


GTCAATCCCG 


ATGCCAGGAT 


CTGGCTCTTA 


AGGCTGGCAA 


TCTCGGCTTC 


GCCGACGAGG 


180 


TCGTGGCTGA 


CCGCAGGGAC 


CGGATCCTGC 


CACAGCAGGG 


TCTGCTTGGG 


GACCAGCGGC 


240 


CCAAGGTATC 


TCGCAACGGG 


ACCCATGTCT 


CGGTGGATCA 


GCTTGTACCA 


GGCCTTGGCG 


300 


AACTCGTCGG 


CCAATTCCTC 


GGGGTGTTCC 


AGCCAGCGAC 


GCGTGATCCG 


CTCATAGATC 


360 


GGATCCACCC 


GCAGCGAGAG 


GTCAGTGGCC 


AGCATCGTCG 


GGGAGCGCCC 


TGGCCCGCCG 


420 


AACGGGTCCG 


GGATGGTGCC 


GGCACCGGCG 


CCGTCCTTGG 


CGGTGTATTG 


CCAAGCGCCA 


480 


GCAGGGCTCT 


TCGTCAGCTC 


CCACTCGTAG 


CCGTACAGGA 


TCTCGAGGAA 


ACTGTTGTCC 


540 


CATTTCGTCG 


GGGTGTTCGT 


CCATACGACC 


TCGATGCCGC 


TGGTGATCGC 


GTCCTTACCG 


600 


GTTCCGGTGC 


CATACGAGCT 
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(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:150: 



TTGACCTCCC 


ACCCGACTTG 


TGGCTGCAGG 


CGGATGCGAC 


CACCGTTGGC 


GCCGCCGCGC 


60 


TTGTCGCTAC 


CACGGAACGA 


CGACGCCGCC 


GCCCATGCGG 


TCGAAACTAG 


CTGTGAGACA 


120 


GTCAATCCCG 


ATGC C AGG AT 


CTGGCTCTTA 


AGGCTGGCAA 


TCTCGGCTTC 


GCCGACGAGG 


180 


TCGTGGCTGA 


CCGCAGGGAC 


CGGATCCTGC 


TV /"T TV /""I ^"1 TV /""l /"""I 

CACAGCAGGG 


TCTGCTTGGG 


GACCAGCGGC 


240 


/■l /'"l 7\ 7\ /I /I rp TV rp/*l 

L L AACjCj I A I L 


rp /-i /i /-i 71 71 /-i/i 

I HjLAALCjCjCj 


T\ ft /~1 /~1 Tv rp/~i rp/^i rp 

ALLCAIOILI 


LCj(j I CjCjA I U A 


GL Tl (j TACLA 


GGCCTTGGCG 


0 A r\ 

3 00 


AACTCGTCGG 


CCAATTCCTC 


GGGGTGTTCC 


AGCCAGCGAC 


GCGTGATCCG 


CTCATAGATC 


360 


GGATCCACCC 


GCAGCGAGAG 


GTCAGTGGCC 


AGCATCGTCG 


GGGAGCGCCC 


TGGCCCGCCG 


420 


AACGGGTCCG 


GGATGGTGCC 


GGCACCGGCG 


CCGTCCTTGG 


CGGTGTATTG 


CCAAGCGCCA 


480 


GCAGGGCTCT 


TCGTCAGCTC 


CCACTCGTAG 


CCGTACAGGA 


TCTCGAGGAA 


ACTGTTGTCC 


540 


CATTTCGTCG 


GGGTGTTCGT 


CCATACGACC 


TCGATGCCGG 


TGGTGATCGC 


GTCCTTACCG 


600 


GTTCCGGTGC 


CATACGAGCT 
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(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
AGAGTTTGAT CCTGGCTCAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
GGCGGACGGG TGAGTAA 17 
(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 

CTGCTGCCTC CCGTAGGAGT 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 
ATGACGTCAA GTCATCATGG CCCTTACGA 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 
GTACAAGGCC CGGGAACGTA TTCACCG 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 
GCAACGAGCG CAACCC 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:157 
ATGACGTCAA GTCATCATGG CCCTTA 
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(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



« « * mm/^» tv /*i tv 

AAATTGAAGA 


GTTTGATCAT 


/-* /-»m j»— * -yv tv nun 

GGCTCAGATT 


GAACGCTGGC 


GGCAGGCCTA 


TV /*1 TV r~1 TV m /-I TV TV 

ACACATGCAA 


60 


GTCGAACGGT 


TV TV /"* T\ /"•✓"I TV TV TV 

AACAGGAAGA 


tv /-i /it i ii i i/i /tmm/"i 

AGCTTGCTTC 


TTTGCTGACG 


AGTGGCGGAC 


GGGTGAGTAA 


120 


rn/^innmnnD tv tv 

TGTCTGGGAA 


ACTGCCTGAT 


f*\f*\ TV /'"I /""•/'"I /~l tv m 

GGAGGGGGAT 


tv tv /"TrrtTV /rn^/'i tv 

AACTACTGGA 


tv TV /**! rp 7V /T /~irri 

AACGGTAGCT 


TV TV m TV TV m 

AATACCGCAT 


180 


AACGTCGCAA 


/~1 7\ /~i /■"I 7V 7\ TV^tTV/^t 

G AC C AAAG AG 


GGGGACCTTC 


GGGCCTCTTG 


CCATCGGATG 


TGC C C AGATG 


240 


/— • /^i -tv mm t\ yim tv 

GGATTAGCTA 


GTAGGTGGGG 


TAACGGCTCA 


CCTAGGCGAC 


/"I TV IT1 /"l /"I ITT TV /"I /I 

GATCCCTAGC 


TGGTCTGAGA 


300 


GGATGACCAG 


/"l/™* TV TV /"I m/~i /~1 TV 

CCACACTGGA 


TV /""im/"» TV TV /™1 TV /"I 

ACTGAGACAC 


GGTCCAGACT 


/-1 /~vrri tv tv /"i 

CCTACGGGAG 


GCAGCAGTGG 


360 


GGAATATTGC 


ACAATGGGCG 


CAAGCCTGAT 


GCAGCCATGC 


yiy* myi tv my* 

CGCGTGTATG 


AAGAAGGCCT 


420 


m fi^H^i ^imm^i m tv 

TCGGGTTGTA 


AAGTACTTTC 


AGCGGGGAGG 


TV TV y"*< /"""I TV /"I m TV TV 

AAGGGAGTAA 


tv mm tv tv m tv /i^i 

AGTTAATACC 


TTTGCTCATT 


480 


GACGTTACCC 


GCAGAAGAAG 


CACCGGCTAA 


CTCCGTGCCA 


GCAGCCGCGG 


TAATACGGAG 


540 


GGTGCAAGCG 


mmTV tv m/i /*i tv tv 

TTAATCGGAA 


TTACTGGGCG 


rrt tv tv tv /**t /1 /"i /™i tv /■** 

TAAAGCGCAC 


TV /""l /"I /l /™i /~i mm 

GCAGGCGGTT 


rn/"imm tv tv yi m yi tv 

TGTTAAGTCA 


600 


/— « t\ mom/i tv tv tv m 

GATGTGAAAT 


CCCCGGGCTC 


tv tv /infn/i/i/in tv 

AAC CTGGGAA 


CTGCATCTGA 


m tv nm/*i/^/m tv f\ 

TACTGGCAAG 


CTTGAGTCTC 


660 


GTAGAGGGGG 


/■■■t m tv /T tv tv 1 1 if 1 1 f% 

GTAGAATTCC 


AGGTGTAGCG 


m/^i tv tv tv my^i y**/*"r 

GTGAAATGCG 


tv tv ^1 tv m/ , im/ , i 

TAGAGATCTG 


GAGGAATACC 


720 


GGTGGCGAAG 


GCGGCCCCCT 


■GGACGAAGAC 


TGACGCTCAG 


GTGCGAAAGC 


GTGGGGAGCA 


780 


AACAGGATTA 


GATACCCTGG 


TAGTCCACGC 


CGTAAACGAT 


GTCGACTTGG 


AGGTTGTGCC 


840 


CTTGAGGCGT 


GGCTTCCGGA 


GCTAACGCGT 


TAAGTCGACC 


GCCTGGGGAG 


TACGGCCGCA 


900 


AGGTTAAAAC 


TCAAATGAAT 


TGACGGGGGC 


CCGCACAAGC 


GGTGGAGCAT 


GTGGTTTAAT 


960 


TCGATGCAAC 


GCGAAGAACC 


TTACCTGGTC 


TTGACATCCA 


CGGAAGTTTT 


CAGAGATGAG 


1020 


AATGTGCCTT 


CGGGAACCGT 


GAGACAGGTG 


CTGCATGGCT 


GTCGTCAGCT 


CGTGTTGTGA 


1080 


AATGTTGGGT 


TAAGTCCCGC 


AACGAGCGCA 


ACCCTTATCC 


TTTGTTGCCA 


GCGGTCCGGC 


1140 


CGGGAACTCA 


AAGGAGACTG 


CCAGTGATAA 


ACTGGAGGAA 


GGTGGGGATG 


ACGTCAAGTC 


1200 


ATCATGGCCC 


TTACGACCAG 


GGCTACACAC 


GTGCTACAAT 


GGCGCATACA 


AAGAGAAGCG 


1260 


ACCTCGCGAG 


AGCAAGCGGA 


CCTCATAAAG 


TGCGTCGTAG 


TCCGGATTGG 


AGTCTGCAAC 


1320 


TCGACTCCAT 


GAAGTCGGAA 


TCGCTAGTAA 


TCGTGGATCA 


GAATGCCACG 


GTGAATACGT 


1380 
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TCCCGGGCCT TGTACACACC GCCCGTCACA CCATGGGAGT GGGTTGCAAA AGAAGTAGGT 1440 

AGCTTAACCT TCGGGAGGGC GCTTACCACT TTGTGATTCA TGACTGGGGT GAAGTCGTAA 1500 

CAAGGTAACC GTAGGGGAAC CTGCGGTTGG ATCACCTCCT TA 1542 
(2) INFORMATION FOR SEQ ID NO:159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1513 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



TTTTTATGGA 


GAGTTTGATC 


CTGGCTCAGA 


GTGAACGCTG 


GCGGCGTGCC 


TAATACATGC 


60 


AAGTCGAACG 


ATGAAGCTTC 


TAGCTTGCTA 


GAAGTGGATT 


AGTGGCGCAC 


GGGTGAGTAA 


120 


GGTATAGTTA 


ATCTGCCCTA 


CACAAGAGGA 


CAACAGTTGG 


AAACGACTGC 


TAATACTCTA 


180 


TACTCCTGCT 


TAACACAAGT 


TGAGTAGGGA 


AAGTTTTTCG 


GTGTAGGATG 


AGACTATATA 


240 


GTATCAGCTA 


GTTGGTAAGG 


TAATGGCTTA 


CCAAGGCTAT 


GACGCTTAAC 


TGGTCTGAGA 


300 


GGATGATCAG 


TCACACTGGA 


ACTGAGACAC 


GGTCCAGACT 


CCTACGGGAG 


GCAGCAGTAG 


360 


GGAATATTGC 


GCAATGGGGG 


AAACCCTGAC 


GCAGCAACGC 


CGCGTGGAGG 


ATGACACTTT 


420 


TCGGAGCGTA 


AACTCCTTTT 


CTTAGGGAAG 


AATTCTGACG 


GTACCTAAGG 


AATAAGCACC 


480 


GGCTAACTCC 


GTGCCAGCAG 


CCGCGGTAAT 


ACGGAGGGTG 


CAAGCGTTAC 


TCGGAATCAC 


540 


TGGGCGTAAA 


GGGCGCGTAG 


GCGGATTATC 


AAGTCTCTTG 


TGAAATCTAA 


TGGCTTAACC 


600 


ATTAAACTGC 


TTGGGAAACT 


GATAGTCTAG 


AGTGAGGGAG 


AGGCAGATGG 


AATTGGTGGT 


660 


GTAGGGGTAA 


AATCCGTAGA 


TATCACCAAG 


AATACCCATT 


GCGAAGGCGA 


TCTGCTGGAA 


720 


CTCAACTGAC 


GCTAAGGCGC 


GAAAGCGTGG 


GGAGCAAACA 


GGATTAGATA 


CCCTGGTAGT 


780 


CCACGCCCTA 


AACGATGTAC 


ACTAGTTGTT 


GGGGTGCTAG 


TCATCTCAGT 


AATGCAGCTA 


840 


ACGCATTAAG 


TGTACCGCCT 


GGGGAGTACG 


GTCGCAAGAT 


TAAAACTCAA 


AGGAATAGAC 


900 


GGGGACCCGC 


ACAAGCGGTG 


GAGCATGTGG 


TTTAATTCGA 


AGATACGCGA 


AGAACCTTAC 


960 


CTGGGCTTGA 


TATCCTAAGA 


ACCTTTTAGA 


GATAAGAGGG 


TGCTAGCTTG 


CTAGAACTTA 


1020 


GAGACAGGTG 


CTGCACGGCT 


GTCGTCAGCT 


CGTGTCGTGA 


GATGTTGGGT 


TAAGTCCCGC 


1080 


AACGAGCGCA 


ACCCACGTAT 


TTAGTTGCTA 


ACGGTTCGGC 


CGAGCACTCT 


AAATAGACTG 


1140 


CCTTCGTAAG 


GAGGAGGAAG 


GTGTGGACGA 


CGTCAAGTCA 


TCATGGCCCT 


TATGCCCAGG 


1200 


GCGACACACG 


TGCTACAATG 


GCATATAGAA 


TGAGACGCAA 


TACCGCGAGG 


TGGAGCAAAT 


1260 


CTATAAAATA 


TGTCCCAGTT 


CGGATTGTTC 


TCTGCAACTC 


GAGAGCATGA 


AGCCGGAATC 


1320 
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GCTAGTAATC GTAGATCAGC CATGCTACGG TGAATACGTT CCCGGGTCTT GTACTCACCG 13 80 

CCCGTCACAC CATGGGAGTT GATTTCACTC GAAGCCGGAA TACTAAACTA GTTACCGTCC 1440 

ACAGTGGAAT CAGCGACTGG GGTGAAGTCG TAACAAGGTA ACCGTAGGAG AACCTGCGGT 1500 

TGGATCACCT CCT 1513 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 



TTTTATGGAG 


AGTTTGATCC 


TGGCTCAGGA 


TGAACGCTGG 


CGGCGTGCCT 


AATACATGCA 


60 


AGTCGAGCGA 


ACGGACGAGA 


AGCTTGCTTC 


TCTGATGTTA 


GCGGCGGACG 


GGTGAGTAAC 


120 


ACGTGGATAA 


CCTACCTATA 


AGACTGGGAT 


AACTTCGGGA 


AACCGGAGCT 


AATACCGGAT 


180 


AATATTTTGA 


ACCGCATGGT 


TCAAAAGTGA 


AAGACGGTCT 


TGCTGTCACT 


TATAGATGGA 


240 


TCCGCGCTGC 


ATTAGCTAGT 


TGGTAAGGTA 


ACGGCTTACC 


AAGGCAACGA 


TACGTAGCCG 


300 


ACCTGAGAGG 


GTGATCGGCC 


ACACTGGAAC 


TGAGACACGG 


TCCAGACTCC 


TACGGGAGGC 


360 


AGCAGTAGGG 


AATCTTCCGC 


AATGGGCGAA 


AGCCTGACGG 


AGCAACGCCG 


CGTGAGTGAT 


420 


GAAGGTCTTC 


GGATCGTAAA 


ACTCTGTTAT 


TAGGGAAGAA 


CATATGTGTA 


AGTAACTGTG 


480 


CACATCTTGA 


CGGTACCTAA 


TCAGAAAGCC 


ACGGCTAACT 


ACGTGCCAGC 


AGCCGCGGTA 


540 


ATACGTAGGT 


GGCAAGCGTT 


ATCCGGAATT 


ATTGGGCGTA 


AAGCGCGCGT 


AGGCGGTTTT 


600 


TTAAGTCTGA 


TGTGAAAGCC 


CACGGCTCAA 


CCGTGGAGGG 


TCATTGGAAA 


CTGGAAAACT 


660 


TGAGTGCAGA 


AGAGGAAAGT 


GGAATTCCAT 


GTGTAGCGGT 


GAAATGCGCA 


GAGATATGGA 


720 


GGAACACCAG 


TGGCGAAGGC 


GACTTTCTGG 


TCTGTAACTG 


ACGCTGATGT 


GCGAAAGCGT 


780 


GGGGATCAAA 


CAGGATTAGA 


TACCCTGGTA 


GTCCACGCCG 


TAAACGATGA 


GTGCTAAGTG 


840 


TTAGGGGGTT 


TCCGCCCCTT 


AGTGCTGCAG 


CTAACGCATT 


AAGCACTCCG 


CCTGGGGAGT 


900 


ACGACCGCAA 


GGTTGAAACT 


CAAAGGAATT 


GACGGGGACC 


CGCACAAGCG 


GTGGAGCATG 


960 


TGGTTTAATT 


CGAAGCAACG 


CGAAGAACCT 


TACCAAATCT 


TGACATCCTT 


TGACAACTCT 


1020 


AGAGATAGAG 


CCTTCCCCTT 


CGGGGGACAA 


AGTGACAGGT 


GGTGCATGGT 


TGTCGTCAGC 


1080 


TCGTGTCGTG 


AGATGTTGGG 


TTAAGTCCCG 


CAACGAGCGC 


AACCCTTAAG 


CTTAGTTGCC 


1140 


ATCATTAAGT 


TGGGCACTCT 


AAGTTGACTG 


CCGGTGACAA 


ACCGGAGGAA 


GGTGGGGATG 


1200 


ACGTCAAATC 


ATCATGCCCC 


TTATGATTTG 


GGCTACACAC 


GTGCTACAAT 


GGACAATACA 


1260 
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AAGGGCAGCG AAACCGCGAG GTCAAGCAAA TCCCATAAAG TTGTTCTCAG TTCGGATTGT 1320 

AGTCTGCAAC TCGACTACAT GAAGCTGGAA TCGCTAGTAA TCGTAGATCA GCATGCTACG 1380 

GTGAATACGT TCCCGGGTAT TGTACACACC GCCCGTCACA CCACGAGAGT TTGTAACACC 144 0 

CGAAGCCGGT GGAGTAACCT TTTAGGAGCT AGCCGTCGAA GGTGGGACAA ATGATTGGGG 1500 

TGAAGTCGTA ACAAGGTAGC CGTATCGGAA GGTGCGGCTG GATCACCTCC TTTCT 1555 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
GAAUACUCAA GCUUGCAUGC CUGCAGGUCG ACUCUAGAGG AUCCCC 46 
(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: 

TTGACAATTA ATCATCGGCT CGTATAATGT GTGGAATTGT GAGCGGATAA CAATTTCACA 60 

CAGGAAACAG CGATGAATTC GAGCTCGGTA CCCGGGGATC CTCTAGAGTC GACCTGCAGG 120 

CATGCAAGCT TGGCACTGGC C 141 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:163: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC 60 

TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACAT ATGGCTAGCA TGACTGGTGG 12 0 

ACAGCAAATG GGTCGGATCC GGCT 144 
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(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:164: 
TAATACGACT CACTATAGGG AGACCGGAAT TCGAATTCCG TGTATTCTAT AGTGTCACCT 60 
AAATCGAATT C 71 
(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(ii) 



MOLECULE TYPE: DNA (genomic) 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG TAATACGACT 



60 



CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT 



120 



GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG GTCATAGCTG 



180 



TTTCCTGTGT GAAATTGTTA TCCGCTCACA ATTCCACACA ACATACGA 



228 
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